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(57) ABSTRACT 

The present invention presents a system and method of 
providing information retrieved from a server from across a 
communication network that enables archiving services that 
do not interfere with existing components and protocols. The 
services enable users to retrieve and/or search for old 
information, even after such information has evolved or 
disappeared from the original server. The network resource 
naming (e.g. URL) format is extended to include archive 
directives that are intercepted and performed by a proxy 
server. The proxy interprets the archive directive and 
executes the specified archival command: e.g., adding the 
information to a storage repository, searching and retrieving 
the information from the storage repository, scheduling 
automatic archiving of specified server information, trans- 
parent archiving of information that is accessed by the client 
or of the client's cache. The information can be easily 
indexed by a timestamp. Multiple proxy servers can col- 
laborate permitting information to be archived in a distrib- 
uted fashion. Embodiments of the present invention advan- 
tageously do not require changes to client or server software 
or communication protocols. 

10 Claims, 16 Drawing Sheets 
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FIG. 2 
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FIG. 3 



System Command 


Desaiplion 


reset & resetall 


Reset the system. The system will reload the externa! 
configuration file and reset to the initial state. 


connect=[On|Off] 


Run the system as on-line mode (On) or off-line mode 
(Off). For the off-line mode, the system won't create 
'Agent Thread" for retrieving web pages. Instead, it 
responds to the requests with local cache files and 
irvic iKnQp rpniiAQte that nannfrf find nanfis in the 
cache. 


setnet=nelid 


Switch the network to a new one "netid". This feature 
is used when the user is using more than one dial -in 
network for Internet access. If netid does not match 
the id in the control file, the system will call a CGI -bin 
to dial -in a different network. 


dns=dns_server 


Create a DNS agent on the local host It then forwards 
local DNS requests to dns server. 


fofword=loaciport ) rhostrpoft[ t proxyhost: 
port] 


Create a TCP forwarding service. Connection to 
tocalport will be mapped to rhostrport, and traffic to 
the former will be forwarded to the latter. This 
command uses "SSL proxy" protocol to connect to 
proxyhostport 


pasv=rhost:rport 


Create a passive channel to another server (running 
on rhostrport). 


htwa!k= level[, -optionJ*[function[=args]J* 


level: specify the depth of hyper reference hierarchy 
the system will walk down, 
option: options include: (a). Retrieving image files 
(image) or not (noimaoe); (b). Accessing only local 
pages (load) or any We6 pages (global), 
function: the javabin function that is called for each 

visited page. 
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FIG. 4 
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1 level 



Option: | loco! M [With image |V 
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• Press 1 Archive 1 to put walked poges into archive. 

• Press | Search | to search | | keyword. 



• Press | ToGo | to put walked poges into l nnypoge [ category. 
♦ Change Cache Policy 



URL pattern: lhtlp://interoctive, wsi.com/poges/techmari 



Cache Flag: 1 On lv 



• Press | OK | to add a coch flag entry. 
• Chonge Cache VolidTime 
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FIG. 6 
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FIG. 7A 

public class inputFiiterSomple extends javobin ) 

public Object Execute(OutputStream os,Hashtoble ht)throws lOException \ 
String url = (String)ht.get("url n ); 
Siring (name = (String)ht.get("path M ); 
byte orig_date[] = readCacheFile(fname); 
byle result_data[] = dolnputFilter{orig_dato); 
writeCacheFiIe(fnome, result_data); 

1 

i " 



FIG. 7B 

public class outputFilterSample extends javabin \ 

public Object Execute(OutpufStream os.Hashtable ht)throws lOException j 
String url = (String)ht.get("url"); 
String fname = (String)ht,get("path"); 
byte cached_dote[] = readCacheFile(fname); 
byte result_data[] = do0utputFii1er(orig_data); 
os.write(result_data); 

\ 

\ 



FIG. 7C 

public class heoderFilterSample extends javabin [ 

public Object Execute(OutputStream os.Hashtable ht)throws lOException [ 
String orig.header = (String)ht.get("M.header w ); 
String result_header = doHeaderFilter(orig_header); 
os,write(result_header-getBytes()); 

i 

i 



02/02/2004, EAST Version: 1.4.1 



U.S. Patent Sep. 23, 2003 Sheet 8 of 16 US 6,625,624 Bl 



FIG. 8 



Statement 


Description 




Null statement, used as comment, 


n :ifexp°..":elselfexp".„ ":else" ":endif M 


Conditional statement. The format of "exp n is 
rt sfrl »5tr9* nr "<?trf1 =str?" 

Oil 1 OllC Ul allll Ollt » 


"reset" 


Reload configuration and restart the proxy 
services. 


":htwalk uri argument" 


Execute an HTML set walkinq Immediately. 


'^forward Ipoiirhostxporf" 


Create a TCP forwardina channel 


";pasv host:port" 


Connect another proxy server and build a 
passive channel. 


.DiQCR wass_ nairie argils i ... .enuuiuuv 


Invoke java bin. class name (with statements.) 


":javabin class name arglist" 


Invoke a javabin.class name. 


".zip file filename"& ":unzip dump filename" 


Compress/decompress an file. 


;ctrlmacth var cW-type" ... M :endctrl" 


Match the entries with ctrljype in ctrlfile.txt and 
store in ${var.id}, ${var.type), $(var.arg), and 
${var.url) variables. 


:ctriadd ctrljype Ctrl arg ctri_urT 


Add a new entry of ctrlfile.txt 


:ctrldelete selected 


Delete the entries whose Ctrl id is assigned in 
arguments. 


:cronmatch var"... M :endcron" 


Match the entries of scheduling and store in 
${var.id) ( ${var.hour), $(var.wday) t Sfvar.cmd), 
${var.url), and $(var.arg) variables. 


xronadd hour wday cmd urt walk level arglist 


Add a new entry of scheduling. 


"xrondelete selected" 


Delete the entries whose cronjd is assigned in 
arguments. 


isysinfo var 

(i forward liagent|i htwalk|th re ads)" ... 
:endsys" 


List the system information. 
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FIG. 1 1 

public class htwalkGrepSample exents jovabin { 
public void initProcQ { 
// init process 

} 

public void HnalProc() { 
// final process 

public Object Execute (OutputStream os.Hoshtoble ht)throws lOException \ 

initProcQ; // init process 
String hlcmd = (S1ring)ht.get("htcmd"); 
String url = (Stnng)h1.get("url"); 
String orgs = (Sirin g)ht.get("args H ); 
String fname = (String)ht.get( "path"); 
String key = (String)hi.get("walkopf ); 

finalProc(); // final process 

if (keywordMatched(fnarne, key)) { 

ShowGrepResutt(url, fname); 

j 

i 
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FIG. 14 
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INFORMATION ACCESS SYSTEM AND 
METHOD FOR ARCHIVING WEB PAGES 

CROSS REFERENCE TO RELATED 
APPLICATIONS 

This application claims priority to Provisional Applica- 
tion Serial No. 60/118,367, "A Proxy-Based Personal 
Portal/' filed on Feb. 3, 1999, and to Provisional Application 
Serial No. 60/118,651, "A Proxy-Based Personal Portal," 
filed on Feb. 4, 1999, and to Provisional Application Serial 
No. 60/126,705, "Archiving Web Pages with iProxy," filed 
on Mar. 29, 1999, and to Provisional Application Serial No. 
60/133345, "Proxy-Based Services/' filed on May 10, 1999, 
the contents of which are incorporated by reference herein. 

This application is also related to the Utility Patent 
Application, "Information Access System And Method," 
Ser. No. 08/994,600, filed on Dec. 19, 1997, now 
abandoned, the contents of which are incorporated by ref- 
erence herein. 

FIELD OF THE INVENTION 

The present invention relates generally to inform atioo 
access systems, and, more particularly, to information access 
systems used to retrieve information from across a commu- 
nication network. 

BACKGROUND OF THE INVENTION 

As more resources become available on communication 
networks such as the Internet, it has become increasingly 
more difficult to locate, manage,' and integrate them. Many 
information retrieval clients such as the browsers by 
Netscape, Microsoft, and Mosaic have been introduced to 
make searching and retrieving information on the network 
more convenient and productive for end-users. 
Unfortunately, the closed architecture of such browsers has 
rendered the software design overloaded and monopolistic. 
Customizing the information retrieval process is difficult if 
not impossible. 

For example, it would be advantageous to enable users to 
retrieve and search through old information — even after the 
original information has evolved or disappeared from the 
original server. Many websites provide timely content such 
as news that changes on a regular basis. A user may wish to 
revisit a web page that the user saw many months earlier, or 
search for information the user recalls seeing on a web page 
that the user visited in the past. Client browsers typically 
only provide a rudimentary save feature and lack basic 
archiving functionality. Any such functionality should be 
provided by some specialized application which must be 
installed or integrated with each client. Otherwise, the user 
must either have the foresight to explicitly save pages he or 
she might be interested in — or must rely on the content 
administrator to provide some archive and search facility. 

SUMMARY OF THE INVENTION 

The present invention presents a system and method of 
providing information retrieved from a server from across a 
communication network that enables archiving services that 
do not interfere with existing components and protocols. The 
services enable users to retrieve and/or search for old 
information, even after such information has evolved or 
disappeared from the original server. The network resource 
naming (e.g. URL) format is extended to include archive 
directives that are intercepted and performed by a proxy 
server. The proxy interprets the archive directive and 
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executes the specified archival command: e.g., adding the 
information to a storage repository, searching and retrieving 
the information from the storage repository, scheduling 
automatic archiving of specified server information, trans- 

S parent archiving of information that is accessed by the client 
or of the client's cache. The information can be easily 
indexed by a timestamp. Multiple proxy servers can col- 
laborate permitting information to be archived in a distrib- 
uted fashion. Embodiments of the present invention advan- 

10 tageously do not require changes to client or server software 
or communication protocols. 

These and other advantages of the invention will be 
apparent to those of ordinary skill in the art by reference to 
the following detailed description and the accompanying 

15 drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 sets forth a diagram of an information access 
2Q system and method illustrating an embodiment of the 
present invention. 

FIG. 2 sets forth an example of a system architecture 
illustrating an embodiment of the present invention. 
FIG. 3 sets forth a list of system commands that can be 
25 invoked using URL extensions. 

FIG. 4 sets forth a screenshot of an example service menu. 
FIG. 5 sets forth a list of commands that can be invoked 
using HTTP extensions. 
30 FIG. 6 sets forth a diagram illustrating the use of filters 
with an information access system and method. 

FIG. 7 A, 7B and 7C set forth examples of cgi-bin filter 
programs. 

FIG. 8 sets forth a list of scripting commands that can be 
35 used with a document pre-processing system. 

FIG. 9 sets forth arguments for use with a javabin class 
invoked by the "block" and "javabin" commands in FIG. 8. 

FIG. 10 sets forth a diagram illustrating the use of 
archiving with an information system and method. 

FIG. 11 sets forth an example of a grep class for use with 
a walking facility. 

FIG. 12 sets forth a screenshot of an example cron menu. 

FIG. 13 sets forth a screenshot of an example archiving 
45 menu, 

FIG. 14 sets forth a diagram illustrating the use of 
multiple archiving repositories. 

FIG. 15 sets forth an example of scripting directives 
embedded in an HTML comment, 

50 

FIG. 16 sets forth a screenshot of an example personalized 
portal. 

FIG. 17 sets forth a screenshot of an example prior art 
portal displaying financial information. 
55 FIG. 18 sets forth a screenshot of an example personalized 
portal. 

DETAILED DESCRIPTION 

FIG. 1 shows a diagram of a communications system 
60 which is suitable to practice the present invention. In the 
exemplary embodiment shown, a proxy server 30 is con- 
nected to a communication network through communication 
link 200 to a gateway 90. The proxy 30 is shown shared by 
a number of clients 60 who are each executing an inform a- 
65 tion retrieval program such as a browser. In an alternate 
embodiment, the proxy and the client browser can be 
executed as processes on the same client machine. Servers 
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70 provide information content to the clients 60 utilizing As an illustration of an example format for a URL 

some document serving protocol, such as the Hypertext extension is: 

Transfer Protocol (HTTP) as described in T. Berners-Lee ct URL?iproxy&command 

al., "Hypertext Transfer Protocol — HTTP/1 .0," RFC 1945, where "?iproxy" is a keyword, and "command" is a service 
Network Working Group, 1996, which is incorporated by 5 that might be applied to a given URL or a system command 
reference herein. As used herein, a document serving pro- that is irrelevant to the given URL. For example, the 
tocol is a communication protocol for the transfer of infor- command could invoke a archiving service (as described in 
mation between a client and a server. In accordance with further detail below in Section 5) or can be used to tailor 
such a protocol, a client 60 requests information from a system functions. The iserver class 2025 parses the corn- 
server 70 by sending a request to the server and the server 10 mands and forwards the command to ihttpd 2026 which, in 
responds to the request by sending a document containing turn, forwards the command to icmd 2028 for processing, 
the requested information to the server. Servers, and the FIG. 3 sets forth a variety of system commands that can be 
information stored therein, can be identified through an defined using URL extensions. 

identification mechanism such as Uniform Resource Loca- In another embodiment of the present invention, a URL 

tors (URL), as described in detail in T Berners-Lee et al., 15 extension can be used to invoke a dynamically generated 

"Uniform Resource Locators," RFC 1738, Network Work- ™ nu of , e f? F™* [™ available services and present it to 

ing Group, 1994, which is incorporated herein by reference. me uscr s b ; ow ^ r ^ r ^ cs ^ ?°< exam P le > te 

, 6 j , ' ... f t 1 - ,1 • . . naming protocol can be extended by introducing a new 

In an advantageous embodiment, the network is the Internet • u 

, . ,, 7 , naming scheme: 

and the servers 70 are Web servers. URL?? 

Proxy server 30, as is known the art, has a processor 140, 20 wfacre ^ doublc . qucstion mark « ?r is a triggcr l0 invokc 

memory 130 and a non-volatile storage device 150 such as a sefvice menu The service menu can ^ imp^med t0 

a disk drive. The memory 130 includes areas for the storage inherit the same cookies as the current URL. The proxy 

of, for example, computer program code and data, as further subsystem intercepts the request from the browser with the 

described below. The proxy server 30 is shown executing above request pattern, generates a menu on-the-fly and 

two processes: an access server 40 and a 105 web server 50, 25 returns the menu to the browser as an HTML file. FIG. 4 

both of which are further described in the pending utility shows an illustrative example of a menu obtained after a user 

patent application, "Information Access System And browses a webpage, here "http://www.interactive.wsj.com/ 

Method," Ser. No. 08/994,600, filed on Dec. 19, 1997, now pages/techmain-html", and types "??" after the URL and 

abandoned,. The access server 40 behaves like a typical presses enter. The menu platform defines entries a number of 

proxy server and accepts document requests (in a preferred 30 services and user-defined macros on a per URL-set basis, 

embodiment, using standard TCP ports and HTTP) and For example, FIG. 4 permits the user to, inter alia, change 

routes them to other proxies or the desired server. The cacne policies, archive or prefetch pages in the html tree, or 

built-in server 50 is designed to act as a function execution search the page hierarchy for a keyword. The menu system 

engine rather than just an information provider. chooses the proper menu description based on the current 

FIG. 2 illustrates an exemplary system architecture, 35 URL. 

adapted for HTTP and implemented as Java classes, pro- The menu service has numerous advantages. It does not 

viding the functionality necessary to practice an embodi- introduce a new graphical user interface, but rather presents 

ment of the present invention. When the proxy application is the menu in HTML content shown in normal browsers. It 

started, the main thread 2010 listens on the proxy port and preserves user's cookies and is easily extensible. New 

receives and responds to HTTP requests from 115 clients 60 40 service entries can be readily plugged into the menu. The 

(browsers or other proxies). The system creates a new User concept of the service menu is akin to pushing special keys 

Thread 2020 to serve each new request. The iserver class during a traditional telephone conversation, for example, 

2025 parses the client's request and forwards commands to "##" — the network then places the line on hold and 

iagent 2035 for remote web access or to ihttpd 2026 for announces a service menu for selection; after menu 

localhost access. The iserver class 2025 also implements 45 selection, the line is placed back to the original conversation, 

various protocol extensions further discussed below in Sec- New commands can also be introduced to HTTP for 

tion 1, The ihttpd class 2026 implements the web server special communications among multiple proxy servers to, 

described above and will return an external file with the hup for example, establish special connections (like persistent 

header or execute a local CGI script to generate the replying channels) and/or perform value-added services (like TCP 

message online. If the content is in a special scripting format 50 forwarding). For example, the commands set forth in FIG. 5 

described in further detail below in Section 3, icmd 2028 can be defined for such communications and used to extend 

will be invoked to parse the script and interpret and execute the conventional HTTP commands, 

any commands embedded in the document page. 2. Filters 

The iagent class 2035 of the agent thread 2030 connects Support is provided for the processing of data by filters, 

to a remote web server 70 or proxy to request a Web page. 55 FIG. 6 illustrates how filters can be applied to http headers, 

iagent 2035 can cache the page and return the page to iserver pages returned from the web, and pages returned from a 

2025 or to ihtwalk 2045, a facility to walk the html tree local cache. For example, an input filter can be used to add 

structure to collect and archive pages. The ihtwalk class new components (menubars, etc.) or modify returned pages 

facility is further described below in Section 4 in the (replace some remote data with local data, etc.). Data from 

description of the walking facilities. 60 servers can also be condensed, compressed, encrypted, 

1 . Protocol Extensions patched, etc., prior to the corresponding filters converting it 

The URL and HTTP protocols can be extended to include back into its original format before returning it to a user, 

additional system commands that advantageously permit The filter functions shown in FIG. 6 can be specified in a 

conventional information retrieval clients to be utilized with configuration file for the proxy's built-in web server. The 

embodiments of the present invention. This can be accom- 65 function can be specified with a corresponding URL pattern, 

plished in a manner that is transparent to the client browser such that the filter is applied to URLs matching the pattern, 

application and, thus, requires no new user interface. For example, the following entries: 
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InputFilter /bin/fpack.cgi http://Vresume.html invoked one by one to perform tasks on a cache of the page. 

OutputFilter /bin/funpack.cgi http://Vresume.html Examples of such functions include functions for archiving 

HeaderFilter /bin/forward.cgi http://www.att.com/* J^ cb pagps > for kevwords > and creating index 

specify cgi functions for mc indicated filters and URL * ^ nG 1() for ^ ^ 

patterns. As indicated in the HeaderFilter entry, all HTTP ^ {w ^ % desigQe d in accordance with a preferred embodi^ 

calls for the web server www.att.com will be processed by mem of the present invemion . ^ £ for me walki 

the cgi function forward.cgi before being sent out to the faction is given in FIG. 3. For example, for the URL 

remote web server (or proxy server), and so on. Filters can " http://www.att.com/?iproxy&htwalk-3,-local,-image ) 

also be specified using an extended URL command. For arcbive,grep=Cable 

example, the system can be configured to calls filters with 10 the system walks down 3 levels for pages referred (directly 

the arguments: or indirectly) by www.att.com, including image files, but 

http://localhost/javabin/iaputfilter?url=cached_ only for those pages on the local server. For each visited 

url&path=cache_file page, the system calls the function archive to archive the 

http://localhost/javabin/outputfilter?url=cached_ 15 P a S e and the Action grep to search for the keyword 

url&path=cache_file "Cable" in the page. For each visited page, the system calls 

http://localhost/javabin/headerfilter?ht.headeroheader_ the followin g Cgi-bin P ro 8 ra ™ one-by-one: 

lj nes /bin/archive. cgi?url=visited_page&args=int_no,- 

Filter programs can be implemented as cgi-bin programs local&path-cached_file 

and inherit the cgi-bin API. Every filter programs imple- 2 o &htwalk=archive 

mented as a Java class will create an instance when invoked. /bin/grep.cgi?url=visited_page&args=int_no f - 

F1GS. 7A, 7B, and 7C show example cgi-bin filter programs, local&path=cached_file 

FIG. 7A shows an inputfilter; the Execute method reading &htwalk=grep&walkopt=Cache 

the "cache _file" and updating the file if necessary. In FIG. FIG - 11 sets forlh an example of a grep class used to 

7B, outputfilter uses "cache_file" as input and sends the 2 5 ira P lemenl tne walking function. 

result to OutputStream which will be sent back to the 5 - Archiving Service 

original caller. FIG. 7C shows headerfilter which gets the ^ URLs processed by the proxy 30 can be extended to 

original http header (multiple lines without the empty line) include archive directives that add data to a storage 

from Hashtable, and outputs a new header to OutputStream. repository, e.g. device 150, and for retrieving the archived 

3. Scripting Facilities 30 data. T° e aew commands are intercepted and performed by 
In addition to Java class invocation mechanisms and the P rox y server Because a proxy is used as a middleman 

standard scripting mechanisms such as Ksh and Perl, it is between the browser and the web servers, the new archiving 

advantageous to provide support for document pre- services are just plug-in components and do not interfere 

processor scripting. Proxy-side scripting allows the different with existing components and protocols, 

system components to be integrated in a light-weight man- 35 As an advantageous example, the URL naming scheme 

ner that has a syntax and a semantics that is very HTML- can be extended to include URLs in the following format: 

like. Scripting provides a method for plugged-in services/ http://view@host/path 

functions to access built-in functions, server status and data where- the "view" is a date, for example in the format 

structure, and the cache. Ascripting proxy pre-processor can yyyymmdd, when the corresponding page (i.e., http://host/ 

also provide a communication media for multiple proxy 40 path) has been retrieved from the original web server and 

servers. stored into the archive repository. The timestamp can be 

The inventors devised a scripting language providing used as the key to' locate- the page from the repository. For 

extra macros statements based on the standard document example, 

markup language HTML. The proxy server pre-processes http ://l 9 98070 l@www.att. com/would refer to the page 

the scripts and turns the comments into pure HTML. In a 45 http://www.att.com that was retrieved and archived from 

preferred embodiment of such scripting, it should include the www.att.com web server on Jul. 1, 1998. 

support for variable declarations, conditional statements, More advanced features can be implemented to' allow 

sets of built-in functions, and interfaces to invoke cgi-bin. specifying an action 'in front of the date for locating an 

The script can be a standard HTML-like plain text file that alternative page on the archive seWer if the dated page does 

contains the macro statements. In order to identify the 50 not exist. The system jirst locates the page' archived on that 

document as a script, an identifier should be included; for date r If the desired page is not found in the repository, the 

example, the first line in the script can start with: system thdn searches for the page in the repository before or 

#!/iproxy/script after the date and returns the first found page. For example, 

The arguments of a cgi-bin can be accessed using "${arg}" the following syntax for naming archived pages can be used: 

inside the script file. FIG. 8 sets forth a list of built-in 55 http://[+|-]yyyymmdd[.hhmmss]@host/path 

statements that are useful for scripting. When using the Thus, http://+date@h'ost/path would choose the first page 

block*' and ":javabin" commands, the script will invoke a found after the specified date; on the other hand, http://- 

javabin class with the arguments set forth in FIG. 9. date@host/path points to the first page before the date. For 

4. Walking Facilities example, http ://+ 1998070 l@www.att.com points to the first 
One useful facility is a basic function supporting a mecha- 60 page that was archived on Jul. 1, 1998 or after, while 

nism to walk through document page hierarchies. This is http:/11998b701@www.alt.com is for the page archived on 

similar to what "find" and "tw" (file tree walk) do on Unix Jul. 1, 1998, or before. As shown in the syntax, the granu- 

file systems. The walking action is specified by a root URL larity of "view" can be readily taken down to the second 

where the action starts, a specification of how many levels level of hours, minutes, and seconds, 

the action will visit, and certain additional properties such as 65 The above naming scheme advantageously is compatible 

whether or not image files should be included. For each with the conventional URL protocol, which defines network 

visited page, one or more of a list of functions can be resource naming as 
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proto://[user[:password]@]host/path. For example, the Thus, multiple repositories may be easily addressed and 

URL accessed to quickly locate the desired information. 

ftp://foo:bar@ftp.research.att.com/README points to a 6 * Packaging Service 

README page on (he FTP server ftp.research.att.com, t A packaging service can be provided which would allow 

while accessing on behalf of the user foo with the 5 ™ browsing of document pages from portable packages, 

password bar. The user/password portion of the URL m P adca 8 e *P>**b\t in that it can be possible to attach 

protocol is undefined in the HTTP protocol; ™ * " I*'™* or to ^ from one machme to another 

j- i .u u u .i-i To create a package, the walking facility may be used to visit 

accordingly, me aoove nammg scneme can taKe aavan- ^ ^ of web ^ 

are designated to be packed. Thus, 

tage of this for archive naming. the system would accept (a) a root URL, (b) a parameter 

Multiple methods can be used to invoke the archive io expressing the depth of the walking through the root URL's 

service and store data in the repository: direct and indirect references, (c) an image option to decide 

Command Extensions. Using any of the methods whether or not to include the image files when walking 

described above under Section 1, a user can invoke the through the web pages, (d) a reference filter option (for 

archiving service. For example, the following URL: accessing all references or filtering out some of them), (e) 

http://www.research.att.com/ 15 mechanisms for storing cookies needed for accessing the 

iPROXY.html?iproxy&action=archive root URL; and (f) a package name. The system then walks 

can be used to cause the system to archive the page through the set of web pages rooted by the designated URL 

http://www.research.att.com/iPROXY.html. ™ d P ac ^ them into a P ac ^§f using the designated name. 

o u j i -a a u* • m. . ii- liic package can use any ot the number or known conven- 

Scheduled Archiving. The system can also be extended to nn 4 . K B . c J t . 7 j .7™ wuuv *" 

^_ — ^ — : — ^ 1-— .--, , j — 20 tional compression formats to package the data. The package 

contain a server- hat executes an archiving task at a desig- fa advuta ^ ousl self-contained, including two physical 

nated time, e.g. like cron command in Unix systems. The files: onfi fof ^ anQther fof ' Each 

walking facilities described above can be used to walk* k can maintain [{s QWn index tab]e 

through a web site for a set of HT^L pages. As described , Q Qrder {Q brows£ ^ k ^ can 

above, the waUcing can be defined by a root URL and 25 support t0 acccss the wcb storcd in , hc 

parameterized by (a) the depth of walking through hyper- kage ^ . f lh were from ^ orf ^ web si(es ^ 

references under HTML pages, (b) with or without image packagcs arc stofcd Qn a ^ ^ flnd brQWsi does nQt 

files embedded in HTML pages and (c) walking through ^ a Qetwork connectioa browsing) me server 

pages on the local web site or on all web sites. FIG. 12 shows ^ {hc above . mentioned indcx table l0 locate corrcspond . 

an example page interface for a cron cgi program which can 3Q { content - n the acka ^ ^ ales the 

be used to add a cron job to archive a web site. Users can pack may be from ^ Qne ^ browscs thc 

schedule an archive task on a daily or weekly basis^ k ^ mMes used to the web ^ 

Transparent Archiving Tlie system can support a finction automatically handled by the s stem 

of archiving selected web pages whenever they have been New documents can easil be appended to an existing 

accessed automatically The specification can be done 35 package. A single package may, in fact, contain more than 

through the CacheFlag of a cache command, which specifies on£ r0Qt URL £ach f00t URL in ^ package caQ be chosen 

the caching policy: as an entry po[nt tQ brQWSe the package 

CacheFlag Archive URL-expression 7 . Personalized Services 

Whenever a requested URL matches the URI^expression, r t j s notable that the proxy system described above can 

the system puts the data in the archive repository. 40 have access to a user's private information such as the web 

Archiving Browser Cache. The system can support a access n j story and sensitive financial information. A user 

function for scanning data cached on a browser's cache area may not be comfortable with providing this information to 

and archiving them into the repository. a server across ^ communication network. The same 

As for retrieving information from the repository, FIG. 13 information can be stored in a safe location locally, on the 

shows how an example screenshot of an interface that can be 45 cueQ t machine where the proxy has been integrated on the 

used to browse the archived information. Each month's data client-side or on a local centralized proxy closer to the user, 

of each website can be stored in a repository structure The present invention permits several new personalized 

similar to a Unix-like pax archive. A cgi program creates a services to be provided. 

page on-the-fly based on the contents of the archive and The proxy can be used to integrate in an automatic manner 

creates hyperlinks for each website name. The hyperlinks 50 the information provided by a typical portal with sensitive 

are listed for each month when the certain pages of the personal information. For example, a user issues a request 

website were archived. For example, the AT&T website has f or m e following URL: 

a cron job to archive the web pages on a daily basis, so the http://www.att.nety?iproxy&actionoportal 

data in FIG. 13 has all twelve months in 1998 listed as ^ portal command W01lld cause the proxy t0 

hyperlinks. „ « retrieve the home page from www.att.net, which has been 

Multiple archive repositories can take advantage of the encoded ^ scripting d i rec tives that instruct the proxy 

above-mentioned features for inter-proxy communication to server how l0 process the local data and e u wilh lhe 

share the burden of storing the information. For example, scrvcr C0Tl[cnts . It then presents the personal portal page 

FIG. 14 shows how multiple archive repositories can col- back t0 the mCT In order to ide that is 

laborate in the archiving task. Proxy server in FIG. 14 60 non . in trusive to other users who are not using the present 

queries another repository in order to answer a user request. mven tion, the scripting directives can be embedded in 

As shown m FIG 14, the following advantageous web HTML comments as shown in FIG. 15. Tlie proxy server 

interface can be used to search the repository or repositories: intercepts the directives in FIG. 15 and performs the nec- 

http://proxyhost/bin/iget/view@host/path esS ary actions before returning the server portal page to the 

A repository search path variable can be set using a com- 65 browser. All the directives, being embedded in an HTML 

mand such as: comment, are ignored by browsers not using a proxy server 

RepositoryPath-Host:Port[;Host:Port]* configured in accordance with the present invention. 
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The following are some illustrative examples of how to 
use the capabilities of personalizing information retrieved 
from the public servers. They are merely examples and are 
not meant to limit the nature of the present invention: 

Personal Web Page Reminders and Hot Sites. Since the 
proxy server can log a user's web accesses, it can analyze the 
log and use the data to provide new web services that can 
improve the user's browsing experience. FIG. 16 shows, for 
example, two possible services: (1) "TO -READ" homep- 
ages. Auser can specify a list of websites and corresponding 
frequencies when certain websites should be visited. The 
proxy can check the last visiting dates and schedule a list of 
pages that the user should visit today. (2) "HOT Sites." The 
proxy server can compute the number of visits to each 
website and list the top ten websites with their last visiting 
dates accordingly as soon as the user accesses his/her 
personal portal. There are several advantages of having such 
a list. The user may want to know the last visiting time of a 
favorite site — the timestamp can be displayed along with the 
website link. The user may wish to access the latest version 
by clicking on the link. The user may wish to compare the 
new version with the old using some differencing tool, 
especially since the previous versions of the web page can 
be archived as described above. FIG. 15 shows the directives 
used to construct the page displayed in FIG. 16. The 
directive "to- read" constructs a list of web pages scheduled 
to be read; the directive "dolog" analyzes the current web 
access log to produce the statistics need for the next 
directive, "top 10" which presents the results on the personal 
portal. 

Personalized Financial Page. Most portals currently allow 
users to specify the stocks that they are interested in and 
display the latest stock price when a user accesses the 
personalized page. See FIG. 17 which shows a typical AT&T 
WorldNet portal showing various stock quotes. However, 
the portal cannot compute your current balance or net 
gain/loss unless you provide private and sensitive informa- 
tion like how many shares you own and when you purchased 
them. Most users would not like to provide such information 
to their portal page server. In accordance with an embodi- 
ment of the present invention, such personalized information 
such as real purchase price, commission fees, and the 
number of shares of each stock can be stored and accessed 
by the proxy server. By constructing an output filter for the 
stock page, the proxy server can retrieve the private 
information, combine it with stock quotes provided by the 
portal site to compute the balance, net gain/loss, and other 
interesting personalized information/statistics. FIG. 18 
shows the same view as FIG. 17 personalized with the user's 
information. This can be accomplished by a specification 
like the following in the proxy configuration file: 

OutputFilter/bin/portfolio.cgihttp:// 
stocks.planetdirect.com/portfolio.asp 
This instructs the system to apply the Java class "portfolio" 
as an output filter whenever the browser issues the corre- 
sponding http request. The numbers in FIG. 18 are visible 
only to the client and not the original portal server. The user 
is shown as having bought 267 AT&T shares, 50 Netscape 
shares, and 40 E*Trade shares at the respective prices of 
$36.50, $21.0, and $39.38, each. The commissions were 0, 
$19.95, and $19.95. The total gain was $17,799.02. The 
numbers replaced by the proxy server are shown in a 
different shade. 

Personal Web Archive. While current search engines 
allow users to find pages of a certain topic easily, they do not 
offer much help in looking and viewing the pages a user has 
seen in the past, except for those that are still kept in the 
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browser cache. Due to the sharp decrease in storage costs, a 
client-side proxy can afford to archive all the web pages a 
user has seen so that any of these pages can be retrieved 
easily later on — without even bookmarking them. Existing 

s wcbpage search tools such as Alta Vista Discovery can be 
used to index the web archive and search the pages. A user 
can then quickly conduct a search of all pages he/she has 
seen in the last year, for example. As described in the 
archiving section above, the instant system can intercept http 

10 requests and effectively extend the URL name space to 
address pages stored in the archive by adding a timestamp in 
front of the regular http address. Even as the web pages go 
through major redesigns, the original pages can be accessed 
using the archive extensions to give the same content. 

15 The foregoing Detailed Description is to be understood as 
being in every respect illustrative and exemplary, but not 
restrictive, and the scope of the invention disclosed herein is 
not to be determined from the Detailed Description, but 
rather from the claims as interpreted according to the full 

20 breadth permitted by the patent laws. It is to be understood 
that the embodiments shown and described herein are only 
illustrative of the principles of the present invention and that 
various modifications may be implemented by those skilled 
in the art without departing from the scope and spirit of the 

2j invention. For example, the detailed description has been 
described with particular emphasis on the Internet standards 
of HTFP, URL's, and HTML. However, the principles of the 
present invention could be extended to other protocols for 
serving information from a server to a client. Such an 

30 extension could be readily implemented by one of ordinary 
skill in the art given the above disclosure. 
What is claimed is: 

1. A proxy server interposed between at least one client 
browser and a server, the proxy server comprising: 
35 a first interface for establishing a document serving pro- 
tocol channel to the server; 
a second interface for establishing a second document 

serving protocol channel to the client browser; 
a storage device for archiving documents; 
40 a processor adapted to 

(i) receive a first requests from the client browser for 
one or more documents on the server, 

(ii) retrieve the documents from the server, 

(iii) store the documents on the storage device, 

45 (iv) associate the stored documents with date informa- 
tion indicating when the documents were retrieved 
from the server, 

(v) sending the documents to the client browser, 

(vi) receive at a later time a second request from the 
50 client browser wherein the request contains an 

archive directive, 

(vii) parse the second requests for an archive directive, 
and, 

(viii) where the requests contain an archive directive 
55 requesting an archived copy of the one or more 

documents and specifying an archival date or a range 
of archival dates, 

(a) access the storage devices, 

(b) search for date information associated with docu- 
60 ments stored on the storage device that match the 

archival date specified in the request, 

(c) if there is matching date information, retrieving 
the documents from the storage device associated 
with the date information and sending the 

65 retrieved documents to the client browser, and 

(d) if there is more than one matching date 
information, sending a list of archived documents 
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with associated matching date information to the 
client browser. 

2. The proxy server of claim 1 wherein the documents 
stored on the storage device are stored as a set in a portable 
package on the storage device and are sent to the client 5 
browser as a set in the portable package when requested by 
the client browser in the archive directive in the request. 

3. The proxy server of claim 1 wherein the archive 
directive is expressed as a uniform resource locator with a 
format of date@bost/path. 10 

4. The proxy server of claim 1 wherein the document 
serving protocol is the hypertext transfer protocol. 

5. The proxy server of claim 1 wherein the document is a 
web page. 

6. A computer readable medium containing executable 15 
program instructions for performing a method on a proxy 
server interposed between at least one client browser and a 
server comprising the steps of: 

receiving a first requests from the client browser for one 
or more documents on the server, 20 

retrieve the documents from the server, 

store the documents on a storage device, 

associate the stored documents with date information 
indicating when the documents were retrieved from the 2 5 
server, 

sending the documents to the client browser, 

receive at a later time a second request from the client 

browser wherein the request contains an archive 

directive, 
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parsing the second requests for an archive directive, and, 
where the requests contain an archive directive requesting 
an archived copy of the one or more documents and 
specifying an archival date or a range of archival dates, 
accessing the storage device, 

search for date information associated with documents 
stored on the storage device that match the archival 
date specified in the request, 

if there is matching date information, retrieving the 
documents from the storage device associated with 
the date information and sending the retrieved docu- 
ments to the client browser, and 

if there is more than one matching date information, 
sending a list of archived documents with associated 
matching date information to the client browser. 

7. The computer readable medium of claim 6 wherein the 
documents stored on the storage device are stored as a set in 
a portable package on the storage device and are sent to the 
client browser as a set in the portable package when 
requested by the client browser in the archive directive in the 
request. 

8. The computer readable medium of claim 6 wherein the 
archive directive is expressed as a uniform resource locator 
with a format of date@host/path. 

9. The computer readable medium of claim 6 wherein the 
document serving protocol is the hypertext transfer protocol. 

10. The computer readable medium of claim 6 wherein 
the document is a web page. 

***** 
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METHOD AND APPARATUS FOR 
TRACKING CLIENT INTERACTION WITH A 
NETWORK RESOURCE AND CREATING 
CLIENT PROFILES AND RESOURCE 
DATABASE 

RELATED APPLICATIONS 

This application is a continuation-in-part of application 
Ser. No. 09/120,376, titled "METHOD AND APPARATUS 
FOR TRACKING CLIENT INTERACTION WITH A NET- 
WORK RESOURCE AND CREATING CLIENT PRO- 
FILES AND RESOURCE DATABASE", filed Jul. 21, 1998, 
now U.S. Pat No. 6,138,155, which is a continuation of 
application Ser. No. 08/821,534, titled "METHOD AND 
APPARATUS FOR TRACKING CLIENT INTERACTION 
WITH A NETWORK RESOURCE AND CREATING CLI- 
ENT PROFILES AND RESOURCE DATABASE", filed 
Mar. 21, 1997, issued Aug. 18, 1998, as U.S. Pat. No. 
5,796,952, the disclosures of which are herein incorporated 
by reference. 

COPYRIGHT NOTICE 

A portion of the disclosure of this patent document 
contains material which is subject to copyright protection. 
The copyright owner has no objection to the facsimile 
reproduction by anyone of the patent document or the patent 
disclosure, as it appears in the Patent and Trademark Office 
patent files or records, but otherwise reserves all copyright 
rights whatsoever. 

FIELD OF THE INVENTION 

The present invention relates to a method and apparatus 
for monitoring client use of and interaction with a resource 
downloaded from a server on a computer network, for 
storing monitored data, for creating a database including 
profiles indexed by user and/or resource identity, and for 
generating customized resources based upon client profiles. 

BACKGROUND OF THE INVENTION 

The development of software packages designed to permit 
simplified graphical user interface (GUI)-based access to the 
wealth of electronic information available over the Internet 
and the World Wide Web has lead to a dramatic increase in 
the amount of information that is currently available over 
public computer networks. Unlike the highly controlled 
atmosphere of a private computer network, however, it is 
difficult to monitor user interaction with network resources 
on public networks. As a result, it is difficult for individual 
servers on a public network to determine how long indi- 
vidual users have interacted with their resources, or how 
much information has been downloaded. It is equally diffi- 
cult for individual servers to target specialized information 
to a particular audience or to learn the identity of individual 
users on a public network. 

The techniques utilized in many private networks for 
monitoring client use and interaction do not lend themselves 
to public networks. For example, user access to a server in 
private networks is generally obtained through the use of a 
unique identification number provided by the server. Details 
of individual user interaction with the network are closely 
monitored by server-resident processes, and historic data- 
bases are automatically generated and continually updated to 
track the nature and amount of information accessed by 
individual users, as well as their connection time. This 
information is generally used, for example, to maintain a 
subscriber-indexed billing database. 
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In a public computer network, however, use of server- 
resident monitoring techniques may be severely limited. In 
some public networks, subscribers are given unlimited 
access, via a service provider, to a virtually unlimited 

s number of servers, and no permanent connection is usually 
made between these servers and a client machine. The nature 
and amount of information downloaded by individual users 
is not easily monitored for each client machine and only 
limited information concerning individual user interaction 

10 with the network may generally be captured by a server (i.e., 
so-called network ID and client ID). 

Due largely to the lack of advanced monitoring tech- 
niques available to individual servers on a public network, 
the same information is generally served out to all clients on 

!5 a completely untargeted basis. In other words, the same 
information is generally downloaded to all users that access 
a particular resource on a server, irrespective of individual 
user interests. There is therefore a need to provide servers on 
a public network with the ability to automatically monitor 

20 use of and interaction with resources downloaded by users 
so as to facilitate the targeted serving of information. 

While various methods are known for obtaining informa- 
tion concerning user preferences, no such methods are 
automatic. For instance, one application, known as a "cus- 

25 tomizable home page", permits users, upon the request of a 
server, to make certain choices. When a user who has done 
so contacts that server at a later date, the server assembles 
information for downloading to the user in accordance with 
the previously -selected choices. More specifically, the user 

30 visits a so-called "Web page" of a particular server where he 
or she is asked to fill in a blank form by selecting various 
preferences, such as links to favorite Web sites, interests in 
entertainment, sports, and the like. The user then submits 
this information to the server by clicking the so-called 

35 "submit" button of the fill-in form, which causes the client 
to transmit the information to the server. The server returns 
a Web page with a response header which creates, or "sets" 
an ID field located in a file on the client computer (this file 
is known as the "client ID" or "cookie") to include infor- 

40 mation about the user's preferences. When the user later 
returns to a specified Uniform Resource Locator, or "URL", 
on the same server, the "client ID" or "cookie" with the 
previously^set preference information is transmitted in the 
HTTP request header to the server, which can then return a 

45 Web page that is assembled according to the user-specific 
information. This application is disclosed, for example, in A 
Gundavaram, CGI Programming on the World Wide Web, 
O'Reilly Press, 1996. 
While the "customizable home page" facilitates the serv- 

50 icg of information on a limited targeted basis, it does not 
provide for the automatic determination of user interests, 
and inconveniences the user by requesting that he or she 
specify various preferences. Moreover, use of a customiz- 
able home page is limited to individual Web sites and can not 

55 be "spread out" over multiple resources on different servers. 
In other words, while a customizable home page may be of 
use with respect to the particular resources located on a 
single server, it does not serve any purpose for other servers 
on a public network. A variation of this technique is used by 

60 some servers to download executable programs. For 
instance, one such application disclosed by G. Cornell and 
C. S. Horstmann, in Core Java, The SunSoft Press, 1996, 
involves the generation of "order forms" on client comput- 
ers. In this application, the client machine loads a Web page 

65 from a server which has an embedded link to an executable 
program that downloads to and executes on the client 
machine. Upon execution in the client machine, the program 
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contacts the server and retrieves a list of goods and associ- time, a counter representing the number of times the specific 

ated prices. The program allows the user to order various ad has been displayed is incremented. The ad banner itself 

goods and requires the user to fill out a form for billing may have an embedded address referring to yet another Web 

purposes. The user "clicks" on the submit button of the fill-in resource. In such an instance, if the user "clicks" on the ad 

form to transmit the information to the server. Like the $ banner, the client may load a resource on the second server 

customizable home page, this method of user-specific data which oncc again capturcs fa c uscr * s nj and forwards the 

acquisition requires the active participation of the user, and user t0 a We5 resource & a p pr0 pri ate for the displayed 

does not provide for the automatic determination of user ad (for cxamplc> a page 0D me advertiser's Web site). At the 

preferences and interests. same tmiej a coun ( er representing the number of times the 

In addition to the inability to serve out information on a 10 specific ad was clicked on to go to the advertiser's Web site 

targeted basis, which is of enormous concern from a mar- ^ incremented. 

keting standpoint, the limited monitoring capabilities avail- While Web sites and ad banners havCj - n ^ c ^ been 

able to individual servers makes it difficult for servers and valuaMe markcting {ooK thc limited monitoring capabilities 

administrators to determine how long users have viewed avaUable tQ Qn networks m which n0 permaneD t 

their resources and how much information has been down- 35 connection is made between a server and a client (such as the 

loaded by individual users so as to be able to bill client use In(ernel) has prevented these marketmg {oo]s from being 

and interaction with network resources and to analyze the used tQ thdr fiU ^nlM. Since HTTP or Web servers 

value and effectiveness of such resources. As a result, much cannot wtomatically determine the amount of time and the 

of the information provided by a server over a public frequency at which particular users interact with their 

network is the same for all clients. In addition, while it is 20 resourceSj Web site administrators and advertisers cannot 

currently possible to track a user's links within the same accuralely determine the effectiveness of their resources, 

resource, there is no standard way to track user's links across Smcc cannot automatically monitor user interaction 

multiple resources on different servers. For example, a and automatically obtain ^ preferences and interests, 

common occurrence in public networks is when a user is scrvcrs cannot asscmble and scrvc rcsource s targeted to 

viewing a first resource and "clicks on" a link to a second 25 i^vi^ user mter ests. 
resource located on a different server. In such instances, the 

second resource is downloaded and the first resource is BRIEF SUMMARY OF THE INVENTION 

either discarded or held in background. However, there is T . ri , r . , . ri , ^ 

. r . . « « In view of the foregoing shortcomings of the prior art, an 

generally no uniform way in which to monitor such occur- . r t , * • * . -j lL Jr 

; .j-,- ■„ • *i • 1 j - 1 *i_ object 01 the present invention is to provide a method for 

rences. In addition, while it is currently possible to track the m * 1 • *i_ j • . r 

, r A . ' • ! i_ i_ 1 > . tracking the use and interaction of a user with a resource 

number of times a particular resource has been accessed, it j 1 j j r , 1 l r . 1 ■ 

. tl . , -ui . . 1 .u 1 downloaded from a server on a network by use of a tracking 

has generally not been possible to track the length of Ume a L jj j • iL J . . • . 

4 r , ' . . • j. ^1 program embedded in the resource and executable by a 

particular resource has been viewed by a particular user. V * A *i_ i_- . . ■ • . * 

, 1 1 /j 1 1 1. 1 • r *■ client. Another object of the present invention is to transmit 

There is also a great deal of other valuable information .1 , <• ■ • * *• c i- * * *i_ 

„. , u , the tracking information from a client to another computer 

concerning user interaction with a resource which would be 35 „ , 7 , c , , r 

, . , & , . . ± . , , 4 . c . connected to the network for storage and analysis, 

useful to administrators, advertisers, marketing profession- * 

als and the like, but which can not be conveniently collected J ob J ect of *e present invention is to create a 

using current monitoring techniques. database of server including, but not limited to, the 

~ , f »u 1 ( ur h , number of times a resource has been displayed by clients, 

For example, one of the largest public networks, the - - . , , , . r j j » 

« T . .« u u ♦ 1 1 1 j * • the amount of time displayed, and the type and amount of 

"Internet , has become an extremely popular advertising 40 . - ... , 3 , . c J j ™ . . - 

, , . ... t 4 * «x\t u •* « information that was displayed or transferred. This mforma- 

tool. Many companies have their own Internet Web sites . tJ L , t *\ 7 , , . . 

. . 1 • ij • • . tion could be used by network administrators or servers to 

and have also purchased advertising space within more , ' «. .. y c ' ' /? 

\i/oK c,*L «f „ii, 0 r ^™„o„; 0 t cv,r ^, nt , analyze the effectiveness of the resources made available on 

popular Web sites or other companies, ror instance, many . J 

advertisers purchase so-called "advertising banner" (or "ad meir nctwork se rv c rs. 

banner") space within the Web page of a popular site, 45 Yet another ob J ect of the P resent mvention 13 to P rovide 
thereby allowing consumers to "click-through" (i.e., specify means for creating a database of user profiles for use by 
a link) to the Web site of the advertiser. In many cases, the advertisers and/or marketers to determine the effectiveness 
use of an ad banner substantially increases the advertiser's and vahie of network-based advertisements and/or market- 
exposure. Using the limited monitoring techniques available m S resources. 

to Internet servers, however, it is difficult to determine the 50 Sti11 yet another object of the present invention is to 

effectiveness of individual Web sites and ad banners. For provide means for creating a database of user profiles 

instance, known monitoring techniques are generally limited containing details of individual user interaction with and use 

to determining the number of times a Web page was down- °f network resources including, for example, Network IDs 

loaded. Similar techniques are used to determine the number (known as "IP address") and client IDs (known as 

of times an ad banner (which is embedded inside a Web 55 "cookies") that have accessed particular resources, the 

page) has been displayed, and how many times the banner amount of time spent by users interacting with and/or using 

was "clicked" on to visit the Web site of the advertiser. particular resources, and details of choices created by indi- 

Generally, an ad banner is embedded inside a Web page vidual users a particular resource, 

located on a first server through the use of the known It is still yet another object of the present invention to 

HTML<IMG> tag. When a client machine passes a TCP/IP 60 provide means for assembling a resource, such as a Web 

request for the Web page to the first server, the Web page is page or a highly targeted ad banner, in accordance with a 

downloaded to the client, including the ad banner embedded historic user profile. 

using the <IMG> tag. The <IMG> tag is used to reference In order to achieve the above-described and other objects 

a resource (i.e., the "ad banner") stored on the same or a and advantages, a tracking program is embedded in a file 

different server which captures the user's ID (via the HTTP 65 which is downloaded from a server to a client. The tracking 

request header) and dynamically returns an ad related image program need not originate from the same server that sent 

to the client for display within the Web page. At the same the file, and may be obtained, for example, via an embedded 
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URL that points to a different server. The tracking program 
may be part of a larger program that performs other opera- 
tions (such as displaying animations, playing sounds, etc.). 
The tracking program is downloaded from a server and runs 
on the client to monitor various indicia, such as elapsed time, 
mouse events, keyboard events, and the like, in order to 
track the user's interaction with and use of the file or to 
monitor choices (such as selections or links to other 
resources or files) made by the user while within the file. The 
tracking program may also monitor the amount of data 
downloaded by the client Operation of the tracking program 
commences after the program is downloaded and any 
required initialization occurs. 

After monitoring the user's interaction with and use of the 
file downloaded from the server, the tracking program then 
automatically sends the information acquired from the client 
back to a server for storage and analysis. The information 
may be sent before or as the client exits the file, or may be 
sent in response to a predetermined user action. The infor- 
mation preferably includes any available client or network 
IDs. 

The acquired information is preferably stored on a server 
and used to build historical profiles of individual users, to 
serve out highly targeted information based upon user 
profiles, as well as to extract information about how much 
data was downloaded by a respective client, and how long 
or how often specific files were displayed or in use by the 
client. 

Preferably, the tracking program is implemented in a 
network based upon the client/server model, and may be 
implemented in a public network such as the Internet or 
World Wide Web. The tracking program may monitor use of 
and interaction with any of the resources downloaded from 
a server, including an executable program, a database file, an 
interactive game, a multimedia application, and the like. In 
the case of the Internet, for example, the tracked resource 
may, for example, be a file such as a Web page or part of a 
Web page (such as an ad banner). 

In one embodiment of the present invention, the tracking 
program is embedded in an HTML document (such as a Web 
site, a Web page, or part of a Web page — e.g. an "ad 
banner"). A TCP/IP connection is used by a client to pass a 
request for the HTML document. The HTML document is 
stored in a server running an HTTP service and contains text 
and one or more first embedded URLs for pointing to one or 
more graphical images located on a server, the images being 
embedded inside the HTML document using an 
HTML<IMG> tag to specify the source URL for an image. 
The HTML document also contains a second embedded 
URL for pointing to a first executable program that runs on 
a server, the first executable program being embedded inside 
the HTML document using an HTML<IMG> tag to specify 
the source URL for the program. A second executable 
program is also embedded in the HTML document by using 
a third URL for pointing to the second executable program. 
Unlike the first executable program, the second executable 
program is downloaded and runs on the client. The second 
executable program is embedded using the proper HTML 
tag to indicate that it is a program that is executable on the 
client. 

After the HTML document is downloaded to the client, 
the graphical images are fetched using a TCP/IP connection 
to server resources specified by the one or more first URLs. 
In attempting to fetch the resource associated with the first 
executable program, the client causes the program to run on 
the server specified by the second URL. Upon execution of 
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the first executable program, the server captures identifying 
indicia from the client, such as any network or client IDs 
resident in the HTTP request header sent by the client. The 
server stores this information in a client profile database. 

5 The client also fetches the second executable program, 
which is the tracking program. The tracking program down- 
loads to the client, and, after performing any required 
initialization, determines the current time. The tracking 
program also determines the current time upon the perfor- 

10 mance of a predetermined operation on the client computer 
by a user, such as leaving the HTML document. After 
calculating the amount of time the user interacted with and 
displayed the HTML document, i.e., by determining the 
difference in time values, the tracking program uploads the 

15 calculated value to the server for storage in the user profile 
database. 

BRIEF DESCRIPTION OF THE DRAWINGS 

2Q FIG. 1 is a diagram of a computer network in which the 
present invention may be implemented; 

FIG. 2 is a block diagram of a client computer which is 
used in connection with various preferred embodiments of 
the present invention; 

25 FIG. 3 is a flowchart diagram of a first embodiment of the 
present invention, which is a method for monitoring the 
amount of time a Web page is displayed on a client com- 
puter; 

FIG. 4 is a flowchart diagram of a second embodiment of 
30 the present invention, which is a method for monitoring the 
amount of time a Web page is displayed on a client com- 
puter; 

FIG. 5 is a flowchart diagram of a third embodiment of the 
35 present invention; 

FIG. 6 is a flowchart diagram of a fourth embodiment of 
the present invention; 

FIG. 7 is a flowchart diagram of a fifth embodiment of the 
present invention; 
40 FIG. 8 is a flowchart diagram of a sixth embodiment of the 
present invention; 

FIG. 9 is a flowchart diagram of a seventh embodiment of 
the present invention; 

FIG. 10 is a flowchart diagram of a eighth embodiment of 
the present invention; and 

FIG. 11 is a flowchart diagram of a ninth embodiment of 
the present invention. 

DETAILED DESCRIPTION OF THE 
50 PREFERRED EMBODIMENTS 

The teachings of the present invention are applicable to 
many different types of computer networks and may also be 
used, for instance, in conjunction with direct on-line con- 

55 nections to databases. As will be appreciated by those of 
ordinary skill in the art, while the following discussion sets 
forth various preferred implementations of the method and 
system of the present invention, these implementations are 
not intended to be restrictive of the appended claims, nor are 

60 they intended to imply that the claimed invention has limited 
applicability to one type of computer network. In this regard, 
the teachings of the present invention are equally applicable 
for use in local area networks of all types, wide area 
networks, private networks, on-line subscription services, 

65 on-line database services, private networks, and public net- 
works including the Internet and the World Wide Web. 
While the principles underlying the Internet and the World 
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Wide Web are described in some detail hereinbelow in 
connection with various aspects of the present invention, 
this discussion is provided for descriptive purposes only and 
is not intended to imply any limiting aspects to the broadly 
claimed methods and systems of the present invention. 5 

The present invention, although equally applicable to 
public and private computer networks, is particularly useful 
for performing monitoring functions in connection with 
public networks which could not heretofore be performed. 
For this reason, implementation of the present invention will 10 
be discussed in detail in connection with the Internet and the 
World Wide Web. This discussion is equally applicable to 
any network based upon the client/server model. 

Accordingly, as will be appreciated by those of ordinary 
skill in the art, as used herein, the term "client" refers to a 
client computer (or machine) on a network, or to a process, 
such as a Web browser, which runs on a client computer in 
order to facilitate network connectivity and communica- 
tions. Thus, for example, a "client machine" can store and 
one or more "client processes/' The term "user" is used to 
broadly refer to one or more persons that use a particular 20 
client machine. 

FIG. 1 illustrates a known computer network based on the 
client-server model, such as the Internet. The network com- 
prises one or more "servers" 10 which are accessible by 
"clients" 12, such as personal computers, which, in the case 25 
of the Internet, is provided through a private access provider 
14 (such as Digital Telemedia in New York City) or an 
on-line service provider 16 (such as America On-Line, 
Prodigy, CompuServe, the Microsoft Network, and the like). 
Each of the clients 12 may run a "Web browser", which is 30 
a known software tool used to access the Web via a con- 
nection obtained through an Internet access provider. The 
servers allow access to various network resources. In the 
Internet, for example, a Web server 10 allows access to 
so-called "Web sites" which comprise resources in various 35 
different formats. A location of a resource on a server is 
identified by a so-called Uniform Resource Locator, or URL. 

Tne "World Wide Web" ("Web") is that collection of 
servers on the Internet that utilize the Hypertext Transfer 
Protocol (HTTP). HTTP is a known application protocol that 40 
provides users access to resources (which can be informa- 
tion in different formats such as text, graphics, images, 
sound, video, Hypertext Markup Language-" HTML" etc., as 
well as programs). HTML is a standard page description 
language which provides basic document formatting and 45 
allows the developer to specify "links" to other servers and 
files. Links are specified via a Uniform Resource Locator or 
"URL". Upon specification of a link, the client makes a 
TCP/IP request to the server and receives information that 
was specified in that URL (for example another "Web page" so 
that was formatted according to HTML) in return. The 
information returned may be generated in whole or in part by 
a program that executes on the server. Such programs are 
typically known as CGI (Common-Gateway-Interface) 
scripts and can be written using known programming lan- 55 
guages or methods that the server supports, such as PERL or 
C++. A typical Web page is an HTML document with text, 
"links" that a user may activate (e.g. "click on"), as well as 
embedded URLs pointing to resources (such as images, 
video or sound) that the client must fetch to fully render the 60 
Web Page in a browser. These resources may not be located 
on the same server that the HTML document was sent from. 
Furthermore, HTTP allows for the transmission of certain 
information from the client to a server. This information can 
be embedded within the URL, can be contained in the HTTP 65 
header fields, or can be posted directly to the server using 
known HTTP methods. 
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FIG. 2 is a block diagram of a representative "client" 
computer. The same or similar computer can also be used for 
each of the servers. The system unit 21 includes a system bus 
31 to which various components are coupled and by which 
communication between the various components is accom- 
plished. The microprocessor 32 is connected to the system 
bus 31 and is supported by a read only memory (ROM) 33 
and random access memory (RAM) 34. The ROM 33 
contains, among other code, the basic input -output system 
(BIOS) which controls basic hardware operations such as 
the interaction and the disk drives and the keyboard. The 
RAM 34 is the main memory into which the operating 
system 60 and application programs, such as a Web browser 
62, are loaded and cached 63. The memory management 
chip 35 is connected to the system bus 31 and controls direct 
memory access operations, including passing data between 
the RAM 34 and the hard disk drive 36 and the floppy disk 
drive 37. The CD ROM 42, also coupled to the system bus, 
31, is used to store a large amount of data, e.g., multimedia 
programs or large databases. 

Also connected to the system bus 31 are various I/O 
controllers: the keyboard controller 38, the mouse controller 
39, the video controller 40, and the audio controller 41. The 
keyboard controller 38 provides the hardware interface for 
the keyboard 22, the controller 39 provides the hardware 
interface for the mouse (or other hand-operated input 
implement) 23, the video controller 40 provides the hard- 
ware interface for the display 24, and the audio controller 41 
is the hardware interface for the multimedia speakers 25a 
and 25b. A modem 50 (or network card) enables commu- 
nication over a network 56 to other computers over the 
computer network. The operating system 60 of the computer 
may be Macintosh OS, OS/2, A1X, BE OS or any other 
known operating system, and each client computer is some- 
times referred to as a "client machine", a client "computer", 
or simply as a "client." 

As noted above, the Internet includes a public network 
using the Internet Protocol (TCP/IP) and includes servers 10 
which are accessible by clients 12. When a Web browser 62 
is used to access a file on a server 10, the server 10 may send 
information including graphics, instruction sets, sound and 
video files in addition to HTML documents (Web pages) to 
the requesting client. 

In accordance with the present invention, a tracking 
program is embedded in a resource, such as an HTML 
document which is sent from a server to a client based on a 
TCP/IP request. The tracking program may originate on a 
different server than the resource, in which case it may be 
obtained by the client through a TCP/IP request to the other 
server. The tracking program executes on a client machine, 
and is stored, for example, in RAM. The tracking program 
may monitor various indicia, such as time, mouse events, 
keyboard events, and the like, in order to track a user's 
interaction with the Web page. Thus, the tracking program 
may simply monitor the amount of time the user spends 
interacting with the Web page, or may monitor details of 
choices (such as links) made by individual users within a 
particular Web page. 

In some cases, clients will "cache" a resource obtained 
over the network (or temporarily store a copy of the resource 
on the user's computer), and may use the cached copy of the 
resource instead of obtaining it over the Internet when the 
resource is needed at a later time (for example, in order to 
completely render a Web page). In such cases, neither the 
basic operations nor functions of the tracking program nor 
the transmission of tracked information to a server, differ 
from the cases where cached copies were not used. 
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In one embodiment of the present invention, a tracking information requested may be received in any order. When 
program is embedded in an HTML of a Web page and the tracking program has been obtained, the client process 
downloaded by a client. The tracking program may monitor (i.e., the Web browser) saves the tracking program to RAM 
operation of a peripheral input device connected to the client (S305). After any necessary initialization, the tracking pro- 
machine, such as a keyboard or mouse, keep a record of 5 gram initiates a software timer to monitor the amount of time 
which choices, if any, are made by a user, and may monitor the Web page is displayed (S306). When the client leaves the 
the length of time the user has displayed the Web page in Web page (S307), the tracking program calculates the 
addition to the lime spent interacting with a particular part amount of time the user has interacted with and displayed 
of it. While in the preferred embodiment, the tracking the Web page and sends this information to a server. Other 
program is embedded in an .HTML document, those' skilled available client information, such as the network ID and 
inline art will recognize4hat other mechanisms, are possible client ID, or so-called "Cookie" of the client, is also sent to 
for embedding the tracking program in the client hardware, the server (S308). If 'desired; other information concerning 
and* the " patent' is not limited to implementation as an me dienl computer may be automatically acquired and sent 
executable program embedded in' an HTML document. For t0 server, such as 1 the type of hardware in the client 
example, the tracking program may be downloaded and J5 computer and various resources that are resident on the 
installed in a client process, as' would "be the case for a c ^ em computer. 

so-called "plug-in" or "helper" application. Alternatively, Duc t0 ^ technical limitations imposed by the Internet, 

the tracking program can be built into a client application or ! he JAVA programming language was applied to the Internet 

client process such that it need not be separately downloaded ™ 1995 b _J. Prog»™«s * &m Microsystems, Inc. of 

and installed. In addition, the teachings of the present M ^unmn View, Calif For example ^some of the fundamen- 

invention are not limited to use on the Infernet or the World 20 tal ^ h ^J^f ^T'hTh P ro e rammers p a " d f^" 

wu c • ♦ *u * 1 • p*u * ncers arc portability, bandwidth and security. Portability 

Wide Web. For instance, tfie tracking program of the present ^ QW& th / same J ecuU51e code t0 mn J QS& multi * 

invention may be utilized on a so-called Intranet . operating systems. Bandwidth specifies the amount of infor- 

As noted above, a client process, such as a Web browser mation that can transfer across the network at any time. For 

running on the client machine, uses a TCP/IP connection to M instance, high-speed lines categorized as Tl through T3 can 

pass a request to a Web server running an HTTP servi ce (or transmit data at 1.544 through 45 megabits per second, 

"daemon" under the UNIX operating system). The HTTP ISDN lines can transmit data at rates of 64 through 128 

service then responds to the request, typically by sending a kilobits per second, and standard phone lines, over which 

Web page formatted in the Hypertext Markup Language, or most users transmit data, currently transmit using modems at 

HTML, to the browser. The browser displays the Web page 30 approximately 28.8 kilobits per second. In the case of a 

using local resources (e.g., fonts and colors). Unless the 640x480 pixel window on a computer display that is capable 

tracking program is already resident in the client, it is of displaying images in 256 colors (which requires one byte 

embedded in the Web page and downloaded to the client per pixel), in order to display the window's contents requires 

along with the Web page. The tracking program is executed 307,200 bytes of data. To create an animation, programs 

after any required initialization has occurred. The tracking 35 typically display 15 to 30 different images per second. Given 

program may monitor the length of time the user remains in a 640x480 window, 15 to 30 frames per second would 

the Web page, or any one or more portions thereof, and may require 4,608,000 to 9,216,000 bytes per second. Because 

track some or all mouse and keyboard events to provide many users are currently browsing the Web using 28.8 

meaningful data to the server concerning the user's interac- kilobit (or slower) modems, there is simply not enough 

tion with the Web page. 40 bandwidth to download animation screens. As a result, many 

In its simplest form, the tracking program is a timer Web sites today resemble magazines whose images are for 

program linked to an HTML document and is downloaded the most part static (unchanging). However, to satisfy an 

and executed on a client when the HTML document is audience that spends many hours in front of dynamic 

served to the client in response to a client TCP/IP request. television images, Internet programmers and engineers must 

During or after the client formats and displays the Web page 45 provide a way to animate Web sites. One solution is to 

specified by the HTML document, the tracking program download programs written in the JAVA programming lan- 

begins a software timer to monitor the amount of time the guage that implement the animation. 

Web page is displayed on the client computer. Animation is only one example of the use of JAVA. Using 

When the user leaves the Web page (for example, by JAVA, programmers can create stand alone programs similar 

exiting the Web page or "clicking" on a link to another 50 to those that programmers can develop using C++, and can 

resource on the same or another server), the tracking pro- also create so-called "applets" that run within a Web 

gram sends the monitored time to another computer on the browser. To address security issues, JAVA developers 

Internet for storage and analysis. ensured that a programmer could not develop a computer 

As illustrated, for example, in FIG. 3, the client issues a virus using a JAVA applet and that an applet could not 

TCP/IP request for a Web page located on a Server A (S301). 55 arbitrarily transfer information concerning a user's system 

After a handshaking period, the Server A begins to send the (such as a file on the user's system) back to the server. Thus, 

HTML formatted document, which* contains an embedded JAVA applets have limited operations. For example, a JAVA 

URL referencing the tracking program. The client addition- applet generally cannot currently read or write files on the 

ally issues a TCP/IP request to the Server B referenced by user's system. In this way, an applet cannot store a virus on 

the embedded URL in order to obtain the tracking program 60 a user's disk or arbitrarily read information stored on a 

(5302) . The client also makes any other TCP/IP requests user's disk. In addition, for other security and stability 

(5303) to obtain any other resources (such as images, video reasons, JAVA developers eliminated or changed many 
or sound) needed in order to fully render the Web Page features of the C and C++ programming languages, such as 

(5304) . Each of such resources are typically referenced by pointers, with which advanced programmers could bypass 
individual URLs embedded in the HTML document. These 65 JAVA'S security mechanisms. 

requests need not occur in any specific order and may JAVA applets run within a "JAVA-enabled client", such as 

reference resources located on any server. In addition, the Netscape Navigator version 2.0 (Windows 95 or Windows 
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NT versions only) or later, or Microsoft's Internet Explorer 
version 3.0, or later. In addition, since most users browse 
with personal computers running Windows, Macintosh, 
UNIX-based systems, and the like, the JAVA developers 
designed JAVA to be portable, or "platform-independent". 5 
Thus, the same JAVA applets can be downloaded and run in 
any JAVA-enabled client process, irrespective of the plat- 
form type. 

JAVA applets can be used by developers to create 
sophisticated, fully interactive multimedia Web pages and 10 
Web sites executable on any JAVA-enabled client. Repre- 
sentative JAVA applets are disclosed, for example, by 0. 
Davis, T, McGinn, and A. Bhatani, in Instant Java Applets, 
Ziff-Davis Press, 1996. 

Since JAVA provides the ability to download complex l5 
programming instructions in the form of applets that are 
executable by a JAVA-enabled Web browser, the tracking 
program of the present invention may be implemented in the 
JAVA programming language. As will be readily appreciated 
by those of ordinary skill in the art, however, the teachings 20 
of the present invention are not limited to JAVA applets or 
to the JAVA programming language whatsoever. In connec- 
tion with the Internet, for example, the present invention 
may also be implemented in a so-called "Active-X" 
environment, in which the tracking program is written as an 25 
Active-X component. 

As will be further appreciated by those of ordinary skill in 
the art, security restrictions may, in some cases, prevent one 
from having direct access to information stored on a client's 
hard disk, such as client IDs. In such cases, other means may 30 
be used to obtain this information. For example, when a Web 
browser makes a request for information from a server it 
typically includes certain information about the client in the 
"HTTP request header." The server receiving the request can 
obtain and store this information using known means 35 
implemented, for example, in a so-called "CGI script" 
executable on the server. Therefore, one way of obtaining 
client identifying indicia is to embed a request in the HTML 
file for another resource on a server that will obtain and store 
the indicia. This resource may be a program (such as a CGI 40 
script) that captures relevant information and stores it. This ' 
information can then be combined with information moni- 
tored by the tracking program to provide a more detailed 
knowledge base. This embedded request may be in addition 
to the embedded tracking program. Representative CGI 45 
scripts capable of capturing client identifying indicia are 
disclosed by A. Gundavaram, in CGI Programming on the 
World Wide Web, O'Reilly Press, 1996. 

In order to store client-identifying indicia, such as a user's 
network ID (IP) and client ID numbers (cookies) and 50 
associated tracking information, a database is set up on a 
server. This may be done in any known manner, such as by 
using a commercially- available database program designed, 
for example, for the high-speed processing of large data- 
bases. In the case of the tracking program described above, 55 
the information stored in the server database may include the 
network ID, client ID, the associated link (the URL of the 
Web page), the amount of time the user spent interacting""' 
with the Web page, and anyseleclions or choices made by 
the user while interacting with the Web page. Thus, the 60 
above-described tracking program permits Web site admin- 
istrators and Internet advertisers, for^example, to determine 
not only the number of user visits orhits made to a particular 
Web page, but also permits the accurate determination of the 
length of time users have displayed and/or interacted with 65 
their Web page. This is invaluable information to Internet 
advertisers, among others, and permits advertisers to make 
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informed decisions as to the effectiveness and value of 
particular Web pages and/or ad banners. 

A more 'particular embodiment of this aspect of the 
invention is illustrated in FIG. 4.' A Web page (or HTML 
document) is requested by the client from a first server A, 
using TCP/IP and HTTP protocols (S401). This HTML 
document contains text, as well as embedded URLs that 
point to graphical images (e.g. GIF format image files) also 
located on the first server A. The images; in general, may be 
located on any HTTP server on the Internet. These images 
are embedded inside " the Web page using the known 
HTML<IMG> tag, which allows one to specify the source 
URL for an image, as well as additional information such as 
size and other layout parameters. These images will then be 
fetched by the client using TCP/IP and HTTP protocols from 
Server A (S402) and rendered on the browser (S405). The 
Web page (or other Web or HTML document) additionally 
includes embedded URLs which point to two resources that 
reside on a second server "B". One of the resources is an 
executable program; which executes- on Server B, and is a 
CGI script. This resource is also embedded inside the Web 
page using the <IMG> tag. Thus, in attempting to render the 
Web page, the client will automatically fetch this resource 
(S403), which forces execution of the CGI script on the 
second Server B and the return of information output from 
the script to the client. In this case, the information returned 
to the client is formatted as an GIF image type which is 
extremely small as, well as completely transparent (S403B). 
When the CGI script executes, it may collect information 
from the HTTP request header such as browser type, net- 
work ID (IP address), and if set, client ID ("cookie"), as well 
as any additional available information such as time of 
execution and the URL of the Web pa^e T and store it in a 
liaTabase^ror example using SQL (S403A, S404). In step 
S403B, the CGI script returns information to the client, 
which includes a response header which indicates (among 
other information^, that the return type is an image, that this 
resource should not be cached by the client, and if no client 
ID is set and the client supports it, that a client ID is to be 
set to a value generated by the script. 

In addition, the CGI script may monitor ( the number of 
times the Web page has been accessed in general. On the 
other hand, another CGI script located on the same or 
another server may be used for this purpose. This process 
may be carried out by simply incrementing a counter each 
time the resource is accessed, or may be conducted at any 
other time by merely counting the number of entries made 
in a stored record of requests made for the resource. 

The other resource located on Server B is a JAVA applet, 
the tracking program-JThis resource can also be located on 
any other server, and is embedded in the Web page using the 
known HTML<APPLET> tag, which allows one to specify 
the source URL (through the CODE and CODEBASE 
parameters) as well as additional size, layout and initializa- 
tion parameters. The client, in attempting to 1 render the Web 
page, will automatically fetch the applet by making a request 
to Server B using the TCP/IP and HTTP protocols (S406). 
Soon after it has received the JAVA code for the tracking 
program, it will first execute the IN1T (initialization) method 
of the applet (S407) and then the START method. The 
START method will make note of the current time using 
standard JAVA methods (S408). The STOP method of the 
applet which is executed, for example, when the user leaves 
the Web page (S409), will compute the difference between 
the current time and the time noted during execution of the 
START method. This difference, which is the time between 
execution of the STOP and execution of the START 
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methods, is sent to the Server B for storage and analysis obtain any other resources (such as images, video or sound) 

(S410). The information can be sent using standard JAVA needed in order to fully render the Web Page (S502A). Each 

network methods, such as opening a URL connection to a of such resources arc typically referenced by individual 

second CGI script on Server B (or any other server) designed URLs embedded in the HTML document. These requests 

to capture the tracked information (S410A). This second 5 need not occur in any specific order, and the information 

CGI script can then obtain any information tracked and requested may be received in any order. When the tracking 

transmitted by the applet as well as any available informa- program has been obtained, the client process (i.e., the Web 

lion in the HTTP request header. This information can be browser) saves the tracking program to RAM (S503B). In 

stored in a database on Server B or elsewhere. If necessary, this case > the tracking program commences a software timer 

the information stored by both scripts may be combined into 10 u P on mc detection of a predetermined user action (S504). 

one or more complete databases. As will be understood by When the user performs another predetermined action 

those of ordinary skill in the art, acquisition of information (S505), the tracking program calculates the amount of time 

by the server need not be conducted using CGI scripts. For between the predetermined user actions, and sends this 

instance, this information may be acquired by any other information, along with other available client information, to 

server-resident process designed for this purpose, or may be 15 the server (S506). 

uploaded by the tracking program or other client-resident Thus, for instance, the software timer of the tracking 

process, such as by a direct connection to a resource located program may be used in monitor the amount of time a user 

on a server (i.e., a database), or by using any other known spends interacting with a portion of a Web page. For 

process. example, if the Web page is provided with an interactive 

The database thus constructed can be indexed by resource 20 resource sucn ^ a g ame or an information resource activated 

identity and may contain information about users who have by clicking on a particular button, the tracking program may 

visited the Web page, such as their network and client IDs, determine how long a user has interacted with such a 

how often they visited the Web page, hawjong the Web page selection. In the case of a Web page provided with an ad 

wa5_displa^e4 a _and so on. Additionally, if the above- banner, the tracking program can be designed to monitor the 

mentioned tracking mechanism is implemented across vari- 25 amount of time a user has interacted with the ad banner, 

ous Web pages in a particular Web site, the database thus The tracking program may be used not only to monitor the 

constructed may contain similar information about the dif- time spent by a user in a Web page or an ad banner, but may 

ferent Web pages in the Web site. Similarly, the information also be used to create a more complex "historical" user 

acquired by the tracking program may be combined' with a profile to permit the server to assemble a Web page or target 

process for monitoring the number of times the Web 30 an ad banner based upon the diverse interests of respective 

resource has been accessed. An analysis of the data on a users. 

user-indexed basis would facilitate the determination of For example, when a user is exposed to an ad banner 
individual user interests and the like. On the other hand, having information targeted to their particular interests, the 
analysis of the data on a resource-indexed basis would allow user is more likely to interact with that ad banner for a longer 
the determination of, for example, which Web pages are 35 period of time and on a more frequent basis, thereby 
viewed the longe'st and/or most' often either by users in increasing the value of that ad banner. In accordance with 
general, or by specific users. Thus, it would be possible to the present invention, in order to learn the particular interests 
determine if there were different types of users that preferred of respective users, an ad banner may include specific 
different sections of the Web site (because, for example, they information permitting the user to interact in different ways 
spent more time browsing differenfsections of the Web site). 40 with the banner. The ad banner may have pulldown menu 
Additionally, if the above<nentioned" tracking program is options, clickable buttons or "hot-spots", keyboard input, or 
attached to an ad banner that' is embedded in multiple Web any number of input mechanisms, whose selection or action 
pages across different Web sites (as is typically the case with upon in a designated manner causes corresponding events to 
ad banners), the- database thus constructed may 'contain take place in the ad banner such as the generation or 
information about how. often and for how long the different 45 synthesis of sounds, the display of images, video, or graphic 
pages that contained the ad banner were displayed, as well animations, or the presentation of different types of Mor- 
as more specific information about users that visited those mation to the user, perhaps with additional choices. Such 
pages. With this information, advertisers could determine information may, for example, include links to interactive 
the accuracy of data supplied to them by Web site admin- games, links to entertainment information, sports-related 
istrators ^ about the number of times their ad banner was 50 games and/or trivia, and the like, or information concerning 
displayed, as well as learn howJong_the_Web page contain- particular goods and services, or means by which to order or 
ing the ad banner was displayed — a number that would be of purchase specific goods and services. The more choices that 
great use in determining the effectiveness of their advertis- are made available, the more information that can be 
ing. , acquired concerning the user's particular interests. Of 
In another embodiment, the softwar e timer of the tracking 55 course, an unlimited number of possibilities are available, 
program may be initiated or stopped when the user incurs a depending upon the application, and an exhaustive listing of 
keyboard or mouse event, such as by "clicking" on a such possibilities cannot be provided herein, 
specified area of an ad banner. This is illustrated in the In this case, the tracking program is downloaded, as 
flowchart shown in FIG. 5. Operation of the system in this described above, with the HTML document in response to a 
embodiment is similar to that shown in FIG. 3. Thus, the 60 TCP/IP client request. As above, the tracking program may 
client first issues a TCP/IP request (S501). After a hand- monitor the amount of time the user spends displaying both 
shaking period, a first Server A begins to send an HTML the Web page and the ad banner embedded in the Web page 
formatted document, which contains an embedded URL as a whole, but also monitors the user's interaction with the 
referencing the tracking program. The client additionally Web page and the ad banner, such as by monitoring each of 
issues a TCP/IP request to a second Server B referenced by 65 the choices made by the user within the Web page and ad 
the embedded URL in order to obtain the tracking program banner. Thus, for example, if an interactive sports-related 
(S502B). The client also makes any other TCP/IP requests to game is included in the Web page, the tracking program will 
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determine if a user has played the game, what his or her information (S607A), such as network and client IDs, any 

score was, how long they played the game, and any other other information such as the URL of the Web page, as well 

possible information. If a choice of different games, each as information captured by the CGI script 1, and the previ- 

directed to a different interest, are made available to users ous \ y constructed historical database profile (S607B), dif- 

within the same ad banner, it is possible to determine what 5 ferent inform ation (images, sounds, text, etc.) may be 

is of most interest to the user by the selection of the game. rct urned to the applet. Such information can therefore be 

In addition, the ad banner may be provided with multiple selected by the scripts based on Network and/or Client ID, 

links to other, diverse Web sites, such as Web sites relating thc URL of mc Wcb page> and thc prcviously constructed 

to sports, entertainment, general information, technology, clienl profile ^ may 5e accomplished in the manner 

history, and the like. The tracking program monitors which 10 d escr ib e d above. 

of the various links are selected and provides this informa- ~ C ™ D lU a t *u i* . j c 

. A j. « ■ ■ *. i_i 1- . The STOP method of the applet which is executed, for 

lion to the server. As discussed above, other available chent , , 4U . *T. ... , /cfAnv ' ... 

. - . , , , , tx. ' ' c «• example, when the user leaves the Web page (S609), will 

information may also be sent to the server. This information t \u j cp u. . \ .u 

j * l j i_ compute the dure re nee between the current time and the 

is sorted and stored m the server database and may be r t , , . - , 0 ™, A ™ , 

, , « ' time noted during execution of the START method. This 

analyzed manually or automatically. is m wbkb % the time between execulion of tht STOp 

The tracked information may be used to assemble ind exeeulion of , he START me , hod {s ^ , 0 , he Smef 

resources geared toward the user s interests. Based upon the B for s , and ana , ^ (S610) ^ mformation can ^ 

historic user profiles created in the server database, down- scntusingslandard j A VA network methods, such as opening 

loading of information to the same client on a subsequent a URL conneclion , 0 a second CGI xript on Server B 

vKil to the same or different Web page may be done on a 20 desi d , 0 c the tracke(J information (S610A) 

more intelligent basis. For example, users who have previ- S610B) |n S610A> , he md CGl x{i , ma ob , ain 

ously expressed an interest in sports-related tnvia (as indi- information acquired by the tracking program (i.e., the 

cated by the ir previously tracked behavior) may be served JAVA ^ ^ weU M c , jent iden(ifying indicia transmit . 

with information targeted to audiences interested in sports. , ed b me clicnt> such 

as in the HTTP request header. This 

Similarly, users who have expressed greater interest in M mformation can be stored m a database on Server g , f 

technology may be served with technology-related informa- necessary> tne m f onnat jon stored by both scripts may be 

tion that would be of much less interest to other users. Trie combined ml0 one more completed databases, 

assembly of a resource such as a Web page may be easily ... A » . . ... 

accomplished. For example, the HTML document of the A In | tms ^^odiment of the present invention, two distinct 

Web page may include a plurality of embedded resources. 30 databases ^ be ^d. The first database is indexable by 

Previous choices made by a user on a particular chent res ° urce ^entity. ^ h » ™ L )> and includesin formation 

computer and stored in a user profile database may be used such « ^ of th % W * b document, number of Umes 

to determine which of the resources is to be downloaded to "cessed ldentU y ° f <r hcnl * "j* acccssed ™ 

that cUent using simple logical processing instructions. For document, amount of time displayed, amount of data 

instance, a user profile which indicates that a user has a 35 dls P la y cd > av , e J a f.? time displayed, number of times 

greater interest in sports-related information than in histori- aCC k eSS />/ nd ^ ^ ca f of an ad banne ' °J oth f 

cal information may be used to download sports-related embedded resource which may be accessed by a link made 

resources, such as GIF-type images and advertisements. ^ a T*^ * TT* T ' ^ d k ltaba f m »y 

Since the user has previously expressed a greater interest in m u clude additional information such as ekek-through rate 

sports, sports-related advertisements may therefore be tar- 40 ( ! he °T °f ****** ad banne ' was <* cked 0D t0 S° to 

geted to that user. the Web Slte of the advertiser), and the like. 

A particular implementation of this mechanism is illus- A second database that mav be created * indexable by 

trated in FIG. 6. A Web page is requested by the client from ^dividual client, and includes information concerning indi- 

Server A(S601). This Web page contains text, as well as vidual c]ieai ' s interests and preferences. These separate 

embedded images which must be fetched from Server A 45 databases ma V be combined in a single database indexable 

(S602) and rendered (S605). In addition, the Web page b X chent or resource identity. 

contains embedded URLs that point to two resources on In another embodiment, illustrated in FIG. 7, the tracking 

Server B. The first resource is a first CGI script 1, which is program is used to create a database of information about a 

embedded inside the Web page using the standard Web site (or, if desired, across multiple Web sites on 

HTML<IMG> tag (S603). In attempting to render the Web 50 multiple servers). In this case, the same tracking program is 

page, the client will automatically fetch the resource asso- embedded in multiple Web pages served up by the same 

ciated with the <IMG> tag on Server B, which will result in Server A. The tracking program in general originates from a 

execution of the CGI script 1. This CGI script lean capture Server B (but may also originate from Server A). The 

client information such as Network ID or Client ID (S603A). tracking program will monitor the time the Web page was 

The CGI script also returns a transparent image (S603B). 55 displayed, and may capture any other information available 

The other resource on Server B is a Java applet, which is to it. For example, the tracking program can determine the 

a combination ad banner and tracking program. This may be UR L of the Web page it is embedded in and may determine 

stored on any server. In attempting to render the Web page, the amount of informalion downloaded by the client, 

the chent will automatically fetch the Java code (S604), In particular, a Web page is requested by the client from 

download, initialize, and start operation of the applet (S607, 60 Server A (S701). This Web page contains text, as well^as 

S608). After the applet is initialized, it contacts Server B to embedded images which must be fetched from Server A 

obtain other resources it needs in order to display images, (S702) and rendered (S705), In addition, the Web page 

play sounds, or control its overall look and behavior. In fact, contains embedded URLs that point to two resources on 

the applet may obtain these resources by executing one or Server B. The first resource is a CGI script, which is 

more CGI scripts or other processes that reside on Server B 65 embedded inside the Web page using the standard 

or elsewhere (S607). Based on information provided to these HTML<IMG> tag (S703). In attempting to render the Web 

scripts through standard HTTP methods, including client page, the client will automatically fetch the resource on 
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Server B, which will result in execution of a CGI script 1. In another embodiment, illustrated in FIG. 9, a client 915 
This CGI script 1 can capture client information such as requests an HTML document 920 from a Server Al 900. The 
Network ID or Client ID (S703A) and returns a transparent HTML document has an embedded reference to a resource 
image (S703B). The other resource on Server B is a Java comprising three components: a component for displaying 
applet. This may be stored on any server. In attempting to 5 images, a component for rendering animations, and a corn- 
render the Web page, the client will automatically fetch the poncnt for pr0 viding form fill-in functionality. 
JAVAcode store £ inRAM ^initialize and start operation of Aftef ^ ^ m hag te<| frQm Servef M m ^ 
the applet (S707). Hie START method of the applet is H ™ Ldocumem ^ m em 4 bedded reference to a resource> 
executed and the applet takes note of the current time nin ' 
(S708). Thereafter, the applet contacts the Server A and, if in mc chcn{ »™ f™* Server B 910 an executable 
security restrictions allow it, the applet queries the Server A 10 P m &™ » 5 - ^L^*} 915 ^ a scri P l 9 f? 
for the page it is embedded in, determines its size, as well as embedded in the HTML document requested from Server Al 
the URLs of other embedded resources (such as images or 90 °- Meanwhile, the executable program retrieves the client 
video), and requests header information about these ID 935 for the client 915 upon which the executable program 
resources in order to determine their size (S709). In this is running. 

case, the tracking program may determine the size of the 15 In the example illustrated in FIG. 9, the script associated 
fully rendered Web page, (i.e., the number of bits that must with the downloaded HTML document indicates that the 
be downloaded in order to fully-render the Web page). If the first component of the referenced resource is one which 
tracking program is part of a larger embedded application displays images on the client 940. Since this component is 
that displays information downloaded from a server (such as stored on Server A2 905, the client 915 downloads the 
a live news feed applet), the tracking program can also 20 functionality and data required to execute the first corapo- 
monitor the amount of information downloaded and dis- nent Q f ^ resource 94Q 
played by the applet. Before or as the user leaves the Web . ' . 
page (S710), the tracking program can transmit this infor- For P«n"»s of this example, it is assumed that the first 
ration to Server B for sforage and analysis (S711, S711A, component of the embedded resource allows for user inter- 
S711B). In this manner, it is possible to build a database of 25 actlon - As the user interacts with the first component, the 
accurate information concerning how often different pages component keeps track of the user's interaction. At some 
of a Web site are requested, how long t^ey are displayed, and point, the user interaction may determine that it is appro- 
how much information was downloaded. This information P riate to start the second component of the embedded 
would be of use to Web site administrators in order to judge resource. Since the second component is stored on Server A2 
the popularity of different Web pages, as well as for example 3Q 905, the client 915 next downloads the second component of 
to set advertising rates for any embedded advertisements. the resource 945. In this example, the second component of 
In another embodiment, illustrated in FIG. 8, a client 815 *e embedded resource provides animation functionality to 
requests an HTML document 820 from Server Al 800. The lhc clienL ^ second component, thus downloaded and 
HTML document 820 has an embedded reference to a functioning 950 on the client 915, tracks user interaction 
resource. The resource comprises two components, each of 35 witn ^ second component. 

which are stored on Server A2 805. Once the client has When the user interaction with the second component so 
requested the HTML document 820 from Server Al 800, the indicates, via the logic of the script associated with the 
client downloads an executable program 825 from Server B HTML document 930, the client will download the next 
810. While this embodiment illustrates the case where the component of the resource; in this example, the third corn- 
executable program is stored on Server B 810, it will be 40 ponent of the resource is the functionality to fill in forms, 
understood by those skilled in the art that the executable Thus, the client 915 downloads from Server A2 905 the 
program may also reside on Server Al 800, Server A2 805, resource's component which provides form fill-in function- 
or any other server coupled to the client 815. Alternatively, ality 955. Once this third component is downloaded, the 
the executable program may reside on the client 815 and client 915 renders the form 960 and allows user interaction 
thus need not be downloaded before being run on the client 45 for filling in the form 965. The form fill-in component tracks 
815. In a preferred embodiment, the executable program is this user interaction with the component, 
an applet. As each component collects user tracking information, the 

Once the executable program has been downloaded 825, information is retrieved by the executable program 975. 

the client program retrieves the client ID 830. Next, the At some point, the user's interaction will indicate that the 

client retrieves the resource ID 835. The client then requests 50 user has left the HTML document which contained the 

the first component of the resource 840, which is stored on resource for which the three components were required 970. 

Server A2 805. The client 815 then renders and/or executes The executable program, having recognized that the user has 

the first component 850 of the resource, allowing for user left the HTML document embedded with the resource, tags 

interaction with the first component. The executable pro- all of the retrieved tracking information from the compo- 

gram tracks the user's interaction with the first component 5S nents with client ID and resource ID information 980. The 

845. The client 815 requests the second component of the appropriately tagged tracking information is then sent by the 

resource 855 and the executable program tracks the user's client program 980 to Server B 910. On Server B 910, there 

interaction with the second component of the resource 860. is running a script 985 which captures the executable 

Once the user interaction is complete 865, the executable program data sent from the client 915 and stores it in 

program packages the client ID, the resource ID, and the 60 database 990 for storage and possible later analysis, 

resource use data 870. This packaged data is then sent 880 While the example illustrated in FIG. 9 has been 

to Server B 810 for storage and analysis. described as one in which the individual components of the 

While this embodiment illustrates an example wherein resource do the tracking of the client interaction such that the 

both the first and second components of the resource are client interaction tracking data generated by each of the 

located on the same server, A2 805, those skilled in the art 65 components is in the proper format, except for being tagged 

will recognize that the individual components of a resource with appropriate ID tags, for storage in the database 990 

need not be stored on the same server. stored on Sever B 910, those skilled in the art will recognize 
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that the components may be designed such that some or all 
of them do not generate data about each user interaction 
which is usable in the database 990, but rather pass the 
interaction data to the executable program 975. The execut- 
able program then must generate data pertaining to the user 5 
interactions in a format suitable for storage in the database 
990. 

The example illustrated in FIG. 10 is closely associated 
with the one illustrated in FIG. 9; the point of difference 
between the two is that in the example illustrated in FIG. 10, 10 
the executable program may decide at a given time, by way 
of example: based on time, user interaction, request of 
Server B 910, or instructions contained in the HTML 
document, to collect tracking information from each of the 
loaded components 1075. When the executable program 15 
decides to collect the interaction information, it will then tag 
all of the collected tracking information with ID data 1080 
and send the tagged information to Server B 910 for storage 
in the database 990. That is, in the example illustrated in 
FIG. 10, the executable program does not necessarily wait 20 
for the user to leave the HTML document with the embed- 
ded resource before collecting the interaction information, 
tagging it with ID information, and sending it to Server B 
910; the executable program may collect the interaction 
information 1075, tag it with ID information 1080, and send 2f 
it to Server B 990 based on an event other than the user 
leaving the HTML document with the embedded resource. 

As with the example illustrated in FIG. 9, the script 
associated with the downloaded HTML document 930 indi- 
cates that the first component of the referenced resource is 30 
one which displays images on the client 940. Since this 
component is stored on Server A2 905, the client 915 
downloads the functionality and data required to execute the 
first component of the resource 940. ^ 

As with the example illustrated in FIG. 9, it is assumed 
that the first component of the embedded resource allows for 
user interaction. As the user interacts with the first 
component, the component keeps track of the user's inter- 
action. At some point, the user interaction may determine 4Q 
that it is appropriate to start the second component of the 
embedded resource. Since the second component is stored 
on Server A2 905, the client 915 next downloads the second 
component of the resource 945. As in the example illustrated 
in FIG. 9, the second component of the embedded resource 45 
provides animation functionality to the client. The second 
component, thus downloaded and functioning 950 on the 
client 915, tracks user interaction with the second compo- 
nent. 

When the user interaction with the second component so 50 
indicates, via the logic of the script associated with the 
HTML document 930, the client will download the next 
component of the resource; in this example the third com- 
ponent of the resource is the functionality to fill in forms. 
Thus, the client 915 downloads from Server A2 905 the 55 
resource's component which provides form fill-in function- 
ality 955. Once this third component is downloaded, the 
client 915 renders the form 960 and allows user interaction 
for filling in the form. The form fill-in component tracks user 
interaction with the component. 60 

At some point, based on, for example and not by way of 
limitation, the time, a request from Server B 910, a particular 
user interaction, or instructions contained in the HTML 
document in which the resource is embedded, the executable 
program collects tracking information from each of the 65 
loaded components 1075. The executable program then tags 
the collected tracking information with client ID and 
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resource ID information and sends the appropriately tagged 
tracking information 1080 to Server B 910. On Server B 
910, there is running a script which captures the executable 
program data sent from the client 915 and stores it in 
database 990 for storage and possible later analysis. 

In another embodiment, illustrated in FIG. 11, a client 
1115 requests an HTML document 1120 from a Server Al 
1100. The HTML document has an embedded reference to 
a resource comprising three components: a component for 
displaying images, a component for rendering animations, 
and a component for providing form fill-in functionality. In 
this example, we are interested in collecting interaction data 
about only two of the three components which comprise the 
resource: the component providing animation functionality 
and the component providing form fill-in functionality. 

After the client 1115 has requested from Server Al 1100 
the HTML document with an embedded reference to a 
resource, the client 1115 downloads from Server B 1110 an 
executable program 1125. The client 1115 then executes a 
script 1130 embedded in the HTML document requested 
from Server Al 1100. Meanwhile, the executable program 
retrieves the client ID 1135 for the client 1115 upon which 
the executable program is running. 

In the example illustrated in FIG. 11, the script associated 
with the downloaded HTML document indicates that the 
first component of the referenced resource is one which 
displays images on the client 1140. Since this component is 
stored on Server A2 1105, the client 1115 downloads the 
functionality and data required to execute the first compo- 
nent of the resource 1140. 

For purposes of this example, it is assumed that the first 
component of the embedded resource allows for user inter- 
action; however, since in this embodiment we are not 
interested in tracking user interaction with the first 
component, the first component does not track user interac- 
tions. At some point, the user interaction may determine that 
it is appropriate to start the second component of the 
embedded resource. Since the second component is stored 
on Server A2 1105, the client 1115 next downloads the 
second component of the resource 1145. In this example, the 
second component of the embedded resource provides ani- 
mation functionality to the client. The second component, 
thus downloaded and functioning 1150 on the client 1115, 
tracks user interaction with the second component. As the 
second component tracks user interactions, it may decide to 
send user interaction information to Server B 1110. If so, the 
second component will retrieve ID -tagging data from the 
executable program and appropriately tag the user interac- 
tion information with the ID-tagging data 1172. The second 
component will then send the tagged user interaction infor- 
mation to Server B 1110 for storage in database 1190 and 
possible later analysis. 

When the user interaction with the second component so 
indicates, via the logic of the script associated with the 
HTML document, the client will download the next com- 
ponent of the resource; in this example the third component 
of the resource is the functionality to fill in forms. Thus, the 
client 1115 downloads from Server A2 1105 the resource's 
component which provides form fill-in functionality 1155. 
Once this third component is downloaded, the client 1115 
renders the form 1160 and allows user interaction for filling 
in the form. The form fill-in component tracks this user 
interaction with the component. 

As with the second component, the third component 
tracks user interactions and may decide to send user inter- 
action information to Server B 1110. If so, the third com- 
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poneni will retrieve ID-tagging data from the executable It should also be appreciated that while the preferred 

program and appropriately tag the user interaction informa- embodiments of the tracking program use a single database 

tion with the ID-tagging data 1170. The third component to store the information, multiple databases could be used to 

will then send the tagged user interaction information to s t 0 re and process the information. 

Server B 1110 for storage in database 1190 and possible later 5 In additiorjj whilc in thc preferred embodiments of the 

analysis. tracking program the server that originated the tracking 

In yet another embodiment the tracking program is used ^ ^ rc&idc Qn ^ samc ^ 

to assemble a bill for the user s access to information For ^ ^ a irement of lhe { invention . ^ database 

example, users who have access to a live news or entertain- • . j -j * u- c *u * w u 

ment feed may be charged according to the amount of 10 ™* re * de on a sc P ara ! e m ^ mc from that which 

information displayed, either according to bit size or time, or 10 ?™* ^ tracking program. Similarly, while ,n the pre- 

both. Imagine that the tracking program is attached to a live ferred embodiments the server that originates the network 

feed applet. The tracking program monitors the lime the resource, or Web page (Server A), and the server that 

information is displayed and the amount of bits downloaded originates the tracking program (Server B) are different 

and automatically transmits this information back to a server servers, this is not a requirement of the present invention, 

when the user leaves. Together with the user's ID (client and 15 The network resource (Web page) and the tracking program 

network), and billing information that the user was previ- may be served out by the same server, 

ously requested to enter, it is possible to determine the it should also be appreciated that while in the preferred 

correct charge for the user. Similarly, a user could be charged embodiments the tracking program uses the HTTP and 

and billed for time spent on a Web page, as well as amount TCP/IP protocols, other network data transmission protocols 

of information downloaded by him or her. 20 CQuld be uged ^ i m pi emenl ^ same functionality. 

The methods embodied in the invention may be used to Moreover, use of an HTML formatted Web page is not 

create web resources with so-called "persistent" state. That neC essary. The information supplied to the user may not be 

is, the tracking program, in addition to the client profile k ^ form of an HTML or Web document such as a Web 

database, may also be used to create a Web resource that ^ Cfm be som& other form of informationt ]n 

appears to automatically "remember the user s previous 25 additi ^ traddn m Qeed nQt be downloaded to 

interactions on the Web resource. This may be implemented ^ ^ frQm ^ ^ ^ ^ be an added module ^ thc 

as in FIG. 6. For example, consider a Web page with an 4 4 . „./ , 4 . 4 

lT j j j A a u-~u -i™ ;Z. rt .n#^o client apphcaUon or Web browser running on the client, or 

embedded Crossword program which also incorporates w & » 

i • u • xiiu <u may be stored elsewhere on the client machine. For 

tracking mechanisms. When the page is rendered and the J . 

o a \ • . , in thf > example, in the former case, added modules could be 

Crossword program commences, a user is able to use tne 30l • i « u L c lj 

, , j j * en * t « *u wu u plug-ins and m the latter case could be referred to as cached 

keyboard and mouse to fill in letters on the Web page based y 6 

i .u . j- i j a, tU «^ ,i,L ^ n ,v a , resources. In such cases, the client application or Web 

on clues that are displayed. At the same time, these choices . ' rt \ 

are tracked, along vri.h any other information including but browser would mclude appropriate means to enable ac Uva- 

not limited to time. Before or at the time the user leaves the Uon ° f ' he f ckm 8 P ro e ra , m and th , e Upl °?* n & °f a chent 

,., , *i_ 4 i j ■ r *• «* f profile based upon foe user s interaction with a Web page or 

Web page, the tracked information is sent to a server for 35 y y * & 

storage (S610). When the user later returns to that page, the network resource. 

network or client ID is used to automatically fill in the letters Moreover, although in the preferred embodiments it is 

in the crossword that were previously selected (As in envisioned that the network resource or Web page is down- 

S607-607C) loaded from a remote server, this is not a limitation of the 

Although the invention has been described in terms of ,0 Mention. The precise location of the target document or 

preferred embodiments, those skilled in the art will recog- se ™r is not important. For example, the target document 

nize that various modifications of the invention can be may even be located on the hard dnve of the chent machine, 

practiced within the spirit and scope of the appended claims. Also, while in the above-described embodiments, the 

Thus, for example, the scripts used to transfer data need not client profile is created automatically using information 

be CGI scripts but could be a dedicated server or a direct 45 acquired by the tracking program and one or more CGI 

connection to the database, such as using JDBC (Java scripts and is stored in the server database, the client profile 

Database Connectivity) to place data into the database. can be created in a different manner and/or supplemented by 

In addition, while the preferred embodiments have been additional information. For example, one such technique for 

described in connection with JAVA applets that are execul- creating a chent profile is through the use of HTML "fill-in" 

able on a client, the tracking of user interaction may be so form tags. In such cases, the client profile is created not by 

accomplished by a client executable program written in a the tracking program, but instead by the client. Based on the 

language other thaD JAVA For example, the teachings of the client profile, the server can serve out information targeted 

present invention may be accomplished using Active-X to the client's interest, as revealed by the fill-in form, 

components in conjunction with the Internet Explorer Web Also, while the preferred embodiments have been 

browser. In addition, the tracking program need not be a 55 described in the context of Web browser software, the 

program that executes on the client computer. For example, techniques of the invention apply equally whether the user 

the tracking program may comprise a CGI script located on accesses a local area network, a wide area network, a public 

a server. Upon execution of the CGI script, the time at which network, a private network, the Internet, the World Wide 

a Web page is downloaded may be determined. By modi- Web, or the like, and whether access to the network is 

fying Web browser software using appropriate instructions, 60 achieved using a direct connection or an indirect connection, 

the browser can be used to send a signal to the server that For example, in connection with the World Wide Web, the 

downloaded the Web page upon the occurrence of a prede- teachings of the present invention apply whether a network 

termined user operation (such as exiting the Web page or connection is obtained via a direct Internet connection or 

clicking on a link to another Web page or resource). In this indirecdy through some on-line service provider. Thus, the 

manner, a program running on the server can be used to 65 "computer network" in which the invention is implemented 

determine the total time period the user has interacted with should be broadly construed to include any computer net- 

and displayed the Web page. work in which one or more clients is connectable to one or 
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more servers, including those networks based upon the 9. The method of claim 1, wherein the step of transmitting 

client-server model in which a client can link to a "remote" the resource use data and resource identifier is performed by 

document (even if that document is available on the same Ihe executable program. 

machine, system, or "Intranet"). 10 - ^ method of claim 9, wherein the monitoring is 

It should also be appreciated that while in the preferred 5 performed by one or more executable elements of the one or 

embodiments the tracking program is downloaded with the more ^ond resources, and comprising the one or more 

„, , r .i? • r f(U executable elements transferring the resource use data to the 

Web page from the server, this is not a limitation of the executable ro ram 

invention. The tracking program need not be embedded tX ^ti. 6 ^u'^V i • -i u • j . 

. . . ■ . .» . • j j j 11. The method of claim 1 wherein the resource use data 

within an existing Web page, but rather may be embedded . , . . . , . , . „ , 

•„u- u u . * i / 1fl comprises data other than data used by the first component, 

within a Web browser or supported elsewhere within the 10 4 , , c . . . . . ' j . 

i- • ir t^l *l * 1 * . ■ 4 j 12. The method of claim 1 wherein the resource use data 

chent itself. Thus, the tracking program may be initiated . , , # . , t . . . 

u ii * «; u *f i j comprises data other than data used by the second or more 

whenever a call to a Web page or network resource is made, r 7 

such as when a search to a particular URL is initiated, or ^ ™ * , . c . . - , . 4t _ 

, • i . j imT • i ■ t 13. The method of claim 1 wherein the resource use data 

when a previously-stored URL is launched. . , 4 , A , , 4 , ~ A 

We claim* 15 com P rises data otner tncn data used by the first resource. 

1 t *" i u ■ 4 Ll 4 14. The method of claim 1 wherein the first program is 

1. In a network having one or more servers connectable to , i , j f ^ . r &• 

, A c ■ . downloaded from the first server, 

one or more clients, a method of monitoring interaction .i. j r i ■ ■■ i. ■ i c . 

t , , c t / ftL , *i_ c * 15. The method of claim 1 wherein the first program is 

through a first client or the one or more clients with a first , i j j . * , i- , r i_ ^ * »■ 

& - , _ - - i i ■. r „ downloaded to the first client as part of the first component 

resource, the first resource comprising a plurality of com- r c t r y 

... r . j • < j nn of the first resource, 

ponents and having a resource identifier associated 2U Ad . ™_ . , - . . - , . n , . 

f, 4 , j . . 16. The method of claim I wherein the first component of 

therewith, the method comprising: £ , ~ , r 

' r & the first resource comprises the first program, 

downloading to the first client a first component of the 17 ^ melhod of claim x comprising downloading the 

first resource from a first server; executable program from a third server, 
executing on the first client an executable program to 25 18. The method of claim 1, comprising downloading the 
download one or more second components of the first executable program from the second server, 
resource; 19. The method of claim 1, wherein the executable 
monitoring interaction through the first client with the first program is a part of the first component of the first resource, 
component of the first resource and with the one or 20. The method of claim 1, wherein the executable 
more second components of the first resource; 30 program is implemented in the Java programming language, 
generating resource use data; and 21. The method of claim 1, wherein the one or more 
transmitting from the first client to a second server second components are implemented in the Java program- 
information comprising resource use data and the ming language. 

resource identifier ^* ^ e method of claim 1, wherein the resource use data 

2. The method of claim 1, wherein the monitoring is 35 comprises data representative of a duration of time for which 
performed by one or more executable elements of the one or the fitst resource was used by the first client. 

more second components. 23 ' ^ method of claim 1, wherein the resource use data 

3. The method of claim 2, wherein the one or more comprises data representative of a duration of time for which 
executable elements transfer resource use data to the execut- thc first component, or a part or parts thereof, was used by 
able program upon use of the one or more second compo- 40 lne client. 

nents 24. The method of claim 1, wherein the resource use data 

4. The method of claim 2, wherein the one or more comprises data representative of a duration of time for which 
executable elements store resource use data upon use of the thc one or more second components, or a part or parts 
one or more second components and transfer the stored thereof > were used b y the first chent - 

resource use data to the executable program upon receipt of 45 25 - ^ method of claim 1, wherein the resource use data 

a request from the executable program. comprises data indicative of whether the first user used the 

5. The method of claim 2, comprising the executable resource. 

program transferring the resource identifier to the one or 26. The method of claim 1 wherein the network comprises 

more executable elements, and wherein the step of trans- a computer network. 

mining the resource use data and resource identifier is 50 27. The method of claim 1 wherein the network comprises 

performed by the one or more executable elements. a communications network. 

6. The method of claim 1 wherein the resource use data 28 - In a computer network comprising one or more 
comprises data generated in response to predetermined servers ™ d 0De or more clienls > a metbod of monitoring 
interactions with the first component of the first resource and interaction through a first client of the one or more clients 
predetermined interactions with the one or more second 55 Wlth a first ^source or at least one component thereof 
components of the first resource. obtained by the first client from at least one first server of the 

7. The method of claim 6 wherein the predetermined one or more servers, the method comprising: 
interactions with the first component comprise interactions downloading the first resource in a series of one or more 
with the first component other than interactions with the first components from the at least one first server to the first 
component used by the first component or the one or more 60 client; 

second components. downloading a tracking program from a server of the one 

8. The method of claim 6 wherein the predetermined or more servers to the first client, wherein the down- 
interactions with the one or more second components com- loading of the first resource or at least one of the 
prise interactions with the one or more second components components thereof causes the downloading of the 
other than interactions with the one or more second com- 65 tracking program; 

ponents used by the first component or by the one or more executing the tracking program on the client computer to 

second components. monitor interaction through the client computer with at 
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least one of the first resource, at least one of the 44. The method of claim 43, wherein the step of the client 
components thereof, and one or more second resources, requesting or using the first resource comprises the client 
the one or more second resources having been obtained retrieving the first resource from cache for one or more files 
by the first client from at least one server of the one or following a first file in the plurality of files, 
more servers as a result of interaction through the first s 45. The method of claim 43, wherein the step of request- 
client with at least one of the first resource, at least one ing a polity of files comprises requesting the files from a 
of the components thereof, and a second resource of the plurality of source servers. 

one or more second resources; ^.Th* mcthod of claim 40> whc rcin the step of requested 

storing resource use data associated with the monitored the second resource comprises requesting the second 

interaction; and 10 resourcc eacn time the one or more files is requested, 

communicating the resource use data to a server of the one 47 The melhod of daim 46> wherein the step of using the 

^n 0 ^ 016 1 - no u ■ a 1 1 second resource to monitor the number of times the first 

29. The method of claim 28, wherein downloading the fc trf Qf ^ ^ monit oring at the 

first resource comprises downloading a Web page. second servef me DUmber of {{mes ^ fesource ^ 

30 The mcthod of c ^laim 29, wherein the tracking program w requested to mm{{oT ^ number of ^ a fik 

is embedded in the Web page. containing an embedded reference to the first resource is 

31. The method of claim 28, comprising storing the requested or used 

resource use data in a server of the one or more servers ^ The method of claim 40 wherein ^ resoufce 

32. The method of claim 31, comprising storing the ^ m cxccutablc progranij comprisin g ^ thc ^nd 

reS ° Ur ^ USe da L ta ' n * d f abaS f 0 , , . . 20 resource to monitor a number of times the first resource is 

33. The method of claim 28, wherein monitoring user rcqucstcd or uscd , whercin using ^ sccoad rcsourcc com . 
interaction compnses monitoring input device events ^ executi ^ resoufce Qn ^ ^ {Q monitor 

34. The method of claim 33 wherein the input dev 1C e cach ^ ^ first TCSQmcc ^ rctrieved from ^ firs( 
events comprise at least one or keyboard events and mouse or ^ ^ cac Yie 

ev ^ ts ~ , , r . „„ , , .25 49. The method of claim 40 wherein the network com- 

35. The method of claim 28, comprising downloading the rfses a ^ofo. 

first resource as a result of a request initiated by input from ^ ^ method of c , aim ^ wherein ^ ne|W(jrk com _ 

a user of the first client. prises a communications network. 

36. The method of claim 28, compnsing downloading the 51 ]n a netWQ[k ^ Qne Qr more connecUble 
tracking program from a second server of the one or more 3Q to one Qr mQK a method of monitoring interaction 

servers. , . . . . through a first client of the one or more clients with a first 

37 The method of claim 28. comprising monitoring resource b a the fils , fesource ^p^g a phlrali , 

details of choices made by a user of the first client usmg an of componenls and having a resource identifler asso^d 

input device of the first client, the choices beuig associated therewith> the method comprising: 

with at least one of the first resource, at least one of the one 1C A , t „ 

or more components thereof, and the one or more second 35 downloading to the first client a first component of the 

_ first resource from a first server; 

resources. ' 

38. The method of claim 28, comprising storing the executing on the first client an executable program to 
resource us data in the client computer. download one or more second components of the first 

39. The method of claim 28, comprising communicating AQ resource; 

the resource use data from the first client to a server of the monitoring interaction through the first client with the first 

one or more servers. component of the first resource and interaction through 

40. In a network having one or more servers connectable the first client with the one or more second components 
to one or more clients, a method of monitoring interaction of the first resource; 

through a client of the one or more clients with a first 45 generating resource use data; and 

resource which is stored on a first server and which may be transmitting from lhe first cliem to a ^nd wrvtT 

stored in a cache, the method comprising: information comprising resource use data and the 

downloading from one or more source servers one or resource identifier. 

more files containing embedded references to the first 52 . ^ me thod of claim 51 wherein the resource use data 

resource and a second resource; 50 comprises data representative of a plurality of preferences of 

the client requesting or using the first resource one or me user 

more times from the first server or from the cache; 53. The method of claim 51 wherein the resource use data 

downloading the second resource from a second server to comprises data representative of a plurality of interests of 

the client, the first server and second server comprising the user. 

two servers and the second resource not being part of 55 54. The method of claim 51 wherein the resource use data 

the first resource; and comprises data which does not affect the user's contempo- 

using the second resource to monitor interaction through raneous experience of the resource. 

the client with the first resource. 55. The method of claim 51 wherein the step of generating 

41. The method of claim 40, wherein the second resource resource use data does not require the active participation of 
is an applet. 60 a user. 

42. The method of claim 41, wherein the applet is 56. The method of claim 51 wherein the step of generating 
implemented in the Java programming language. resource use data comprises the automatic determination of 

43. The method of claim 40, wherein the step of request- a user's preferences. 

ing or using one or more files comprises requesting from one 57. In a network having one or more servers connectable 

or more source servers a plurality of files each containing 65 to one or more clients, a method of monitoring interaction 

embedded references to the first resource and a second through a first client of the one or more clients with a first 

resource. resource comprising a plurality of components, the first 
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resource having a resource identifier associated therewith, 
the method comprising: 

downloading to the first client a first component of the 

first resource from a first server; 
monitoring interaction through the first client with the first 5 
component of the first resource to generate resource use 
data; and 

transmitting from the first client to a second server the 
resource use data and the resource identifier. JQ 

58. The method of claim 57, comprising downloading the 
executable program from a third server. 

59. In a network having one or more servers connectable 
to one or more clients, a method of monitoring interaction 
through a first client of the one or more clients with a first JS 
resource comprising a plurality of components, the first 
resource having a resource identifier associated therewith, 
the method comprising: 

downloading to the, first client a first component of the 
first resource from a first server; 
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executing on the first client an executable program to 
download one or more second components of the first 
resource; 

monitoring interaction through the first client with the first 
component of the first resource to generate resource use 
data; 

monitoring interaction through the first client with the one 
or more second components to generate resource use 
data; 

executing on the first client one or more of the second 
components to download one or more third compo- 
nents of a second resource; 
monitoring interaction through the first client with the one 
or more third components to generate resource use 
data; and 

transmitting from the first client to a second server the 
resource use data and the resource identifier. 
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iMETHOD AND SYSTEM FOR USING A following a palh from the node 140 representing Sue up 

TWO-TIERED CACHE through and including all of the higher levels in the hierar- 
chy. 

_ „. , The information provided to Sue 140 may include values 

BACKGROUND OF THE INVENTION s for ^ such „ s £ stem wnRg)jntion atlr £ uteSi ioduding 

1. Field of the Invention the colors displayed on her screen and the size of the 
The present invention relates to a computer system, and windows on her display. Sue may choose to override these 

deals more particularly with a method, system, and com- values ( or m y values provided by a higher level) in order to 

puter program product for using a two-tiered cache for customize her display to her own preferences. In doing so, 

storing and accessing hierarchical data, in order to minimize 10 Sue has now crealed her own customized set of values 

memory usage and improve performance. representing a coalesced version of all the higher level 

2. Description of the Related Art values from ^ hi e hest level 100 down to and deluding the 
y 4 * j* * *i_ * j i* 4/ . * tL • values Sue 140 provides. This customized set of information 
In the distributed client/server environment, there is a ■ * • n • j - j . ^ .i. 

,,. • * • * is typically stored in a database or other repository on a 

need to disseminate information within a company or cor- 1C e u*u *• u 

..... • j . c j . 15 server tor access by the user any time he needs to use the 

poration that is required by users for everyday operation. . r t . nt / , , , /j *i_ . j- i 

fL. . e . ii ■ ■ . , » • information. (It should be noted that overriding values 

ihis information typically includes such things as account- j c j . u* u i i r*u i_* l * 

. wiMuij luwuuw cuvii awwum defined at a higher level of the hierarchy may in some cases 

ine information, computer configuration information, etc., u v •» j * u*rn*i_jc j / 

° . • ii m Li 1-- t. De limited to a subset of all the defined values, since some 

and is typically available on one or more servers which are h; hef levd va]ues be mandat } 

accessed by the users. Since companies, particularly large ™ * . . , 
corporations, are often organized hierarchically (such as W1 f n a of ,_ a c } ltnt mach ™ "^"f 5 data that * 
divisions, departments, etc.), each of these hierarchical » ahierarchical structure (such as the example for 
levels may specify information pertinent to that level, where F[C \ 1 >> wha ' the "f" wanUi ln f P on f 15 "coalesced- 
each lower level of hierarchical information augments or sel ° f data. As used herein a coalesced set of data refers to 
overrides the level(s) above it. In particular, this hierarchical „ gathering all the data m the hierarchy, and merging data 
approach to specifying information is often used for com- t °S e 1 lher suc , h *at the value for any given attribute is set by 
puter configuration information. In this scenario, each user 'j 1 * lowes ' level L m h,erarch y which has that attribute 
of the company systems accesses the information provided defbe , d - From , lhe user k s P 01 " 1 °f, view > he ^ceives a 
by the higher levels of the hierarchy, for example to use complete set of data, which is really a composite of the 
when connecting to the company network or when installing 30 ac,ual data from the user level and all levels above it m the 

hierarchical chain 

software on his computer. The user at this point occupies the 

lowest level of the hierarchy, and may customize the infor- complex types of stored data may impact overall 

mation obtained from the hierarchy above him. These cus- performance of the server when information is retrieved 

tomized values are often referred to as "user preferences" since the coalescing of the stored data may involve a 

and may be unique to any user within the hierarchy. 35 combination of several memory accesses as well as disk 

FIG. 1 is an example of this type of hierarchical structure. re * ievals to f^ect, and th ™ coalesce > aU the necessary 
This structure represents hierarchically stored information, information before responding to a user request For 
which typically mirrors the hierarchy of the organization. !* am P le > lf Sue T ^ esis «^™tion for the hierarchy of 
Company X 100 represents information stored for the cor- FIG > ^sk accesses (which may further require 
porate or highest level of the company. Any information 4Q rmmd 'T throUgh a network) may be needed to 
defined at this level typically pertains to all elements of the ' e * ie y e the T formation for Company X 100 Development 
hierarchy throughout the company, from organizations to 105 > Uscr Interfacc 115 > and finallv for Suc 140 
individual users, because corporate settings tend to be broad Wmle onl y a simple hierarchy is shown m FIG. 1, many 
in scope and define boundaries for the subordinate levels to more levels (with many nodes at each level) may exist in an 
follow. In this example, there is a subordinate level in 45 organization or corporation. Performing multiple disk 
Company X 100 which represents the development organi- accesses to retrieve information for each level is a coni- 
zation 105 and the sales organization 110. At this level, each tationally expensive operation. In addition, having to con- 
subordinate organization uses the information from Com- tinually recalculate the coalesced image of the data when 
pany X 100 and may specify additional information to be changes occur is compute intensive. Conversely, caching the 
used by its subordinate levels. Either or both of the devel- 50 com plete set of coalesced data for every user is very storage 
opment and sales organizations may also make changes (i.e. or memory intensive and requires a complete recoalescence 
augmenting or overriding defined values) to the original anv time data represented in the coalescence is changed, 
information from the Company X level. This changed infor- Accordingly, what is needed is a technique that avoids the 
mation is then made available to the subordinates of the level performance penalty of this continual recalculation, and 
at which the change was made. 55 avoids the storage penalty of storing large amounts of 

Moving down the hierarchy to the next level under coalesced data, 

development 105, we have two more organizations or pos- SUMMARY OF THE INVENTION 
sibly groups (such as a department of employees): (1) the 

user interface group 115 and (2) the API group 120. Both of An object of the present invention is to provide a tech- 

these groups will inherit the information provided by the 60 nio . ue whic k avoids continual recalculation for coalescing 

higher levels in this hierarchy. These groups also may add or complex hierarchical data and minimizes the impact on 

modify information to further refine the data from the higher performance and memory consumption in a client/server 

levels of the hierarchy, where these changes are then made environment. 

available to the subordinate levels of the hierarchy. In this Another object of the present invention is to provide this 

example, the subordinates to the user interface group 115 are 65 technique using a two-tiered cache, 

the users (140, 145) of the company resources. Here, for Yet another object of the present invention is to provide 

example, Sue 140 inherits all the information available from this technique in a manner that allows a server to determine 
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if a set of coalesced cached values is out-of-date with the corresponding to the particular user's user node in the 

data store it represents. repository; and populating the particular user's client cache 

Other objects and advantages of the present invention will otherwise. This populating further comprises: retrieving the 

be set forth in part in the description and in the drawings ^ data from the particular user's user node in the reposi- 

which follow and, in part, will be obvious from the descrip- * tory; storing the retrieved user data in the client cache for the 

• ■ % • f . user; and setting the client cache timestamp for the user to 

tion or may be learned by practice of the invention. J \ . . . f. r . v.. , 

J J r the last update Umestamp for the corresponding user node. 

To achieve the foregoing objects, and in accordance with Retrieving the CDI for the group node preferably comprises: 

the purpose of the invention as broadly described herein, the retrieving the CDI from the central data cache when (1) the 

present invention provides a method, system, and computer CDI for the group node exists and (2) the CDI timestamp for 

program product for use in a computing environment for 10 the CDI is not older than the last update timestamp corre- 

using a two-tiered cache for storing and accessing hierar- sponding to the group node in the repository; and creating 

chical data. This technique comprises: providing a hierar- the CDI otherwise. Creating the CDI further comprises: 

chical structure of data comprising a top-level node, one or retrieving the data values from the repository for the group 

more intermediate levels having one or more intermediate- node, the top-level node, and all of the intermediate-level 

level nodes, and one or more user nodes, wherein each of the 15 nodes in the hierarchical path from the group node to the 

user nodes is a child node of the top-level node or one of the top-level node; coalescing the retrieved data values; storing 

intermediate-level nodes, wherein the hierarchical structure the coalesced data values as the CDI for the group node in 

is stored in a data repository accessible in the computing lhe central data cache i and the CDI timestamp for the 

environment and wherein each of the nodes has a corre- CDI t0 ^ c Jj?* u P datc timestamp for the corresponding 

sponding last update timestamp stored in the repository, the 20 node * ™* may further comprise repeating 

, t j * * ■ * r iL j j * u * i i this process for each of the group nodes above this group 

last update timestamp for the user nodes and the top-level , *\ . . * A * * 

, r A . i . j . . j4 i node in the hierarchical path until reaching a first of the 

node representing a last update to one or more data values , - ..... Ww.- . r , L f . L ' 

r tL j j 7i_ i . j . *• * <■ *u ■ * group nodes for which the CDI timestamp for the CDI of this 

of the node and the last update timestamp lor the interme- ? , , • , , . t . , « . A 
j. 4 . r j » c j . i r *u first group node is not older than the last update timestamp 
diate nodes representing an update or data values of the «■ . j ■ 
intermediate node or a parent of the intermediate node; 25 ^responding to the group node in the repository 
creating coalesced data images (GDIs) for each of the In one as P ect » the retrieving of user data from the client 
top-level or intermediate-level nodes which is a group node, cache ^ermines whether the client cache Umestamp for the 
wherein a particular node is one of the group nodes when the Particular user's client cache is different from the last update 
particular node has one or more of the user nodes as a child, timestamp corresponding to the particular user s user node 
and wherein the CDI for the particular node comprises a 30 m the repository rather than whether the client cache times- 
coalescence of data values for the particular node, the tam P 15 not th \ last ^ d f! Umest * m P> and . the 
top-level node, and all of the intermediate-level nodes in a retrieving of the CDI from the central data cache determines 
hierarchical path from the particular node to the top-level the CD1 timestamp for the CDI is different from the 
node; storing the created CDIs in a central data cache along last u P date umestamp corresponding to the group node in 
with a CDI timestamp for each of the stored CDIs wherein 35 ^ repository rather than whether the CDI timestamp is not 
the CDI timestamp for each of the CDIs is set to the older than lhe last u P date timestamp. 
corresponding last update timestamp for the corresponding Retrieving an unrefreshed result preferably further corn- 
node; and storing user data for each of one or more users in P rises: retrieving the user data for the particular user; retriev- 
a client cache for the user along with a client cache ing the CDI for the group node of which the user node is one 
timestamp, wherein: each of the users is associated with a 40 of tQe children; and merging the retrieved user data with a 
selected one of the user nodes; the client cache timestamp is Parent CDI associated with a parent node of the user node, 
set to the corresponding last update timestamp for the Retrieving the user data further comprises: retrieving the 
corresponding user node; and the stored user data is uncoa- user data from the client cache when the client cache for the 
lesced. particular user exists; and populating the particular user's 

- . , 45 client cache otherwise. This populating further comprises: 

This technique may further comprise updating one or ... . i . « , » . 

£ ,i j . / c i * j j u t u- re trie vine the user data from the particular user s user node 

more of the data values for a selected node, where this . 4 . to . t , . . ■ , ■> . . . 

, r, . . . j * * it. a 4 m the repository; storing the retrieved user data in the client 

updating further comprises: applying an update to the data , . Jl . v t , . c 

values in the repository; updating the last update timestamp <! lch6 for , th^r; and setUng the chent cache Umestamp for 

correspond^ to the selected node; determining whether the <n the use < 10 last Up T™t T corres P° nd ! n e 

, 4 . j • ftL j j 50 user node. Retrieving the CDI further comprises: retrieving 

selected node is one of the group nodes; and propagating the 4 . . iU * ij* u u >u r*™ c .u 

updated timestamp to each of the group nod««tordinate to the CDI ^ ^ ce ° tral d , ala f h ch i^ he " "».CDI for lhe 

the selected group node in the hierarchical structure when T^n, 0 ^ ; S , „ h T^i f 8 

4 , , . • r u i. trie CDI further comprises: retrieving the data values from 

the determination has a positive result. . . . . * , . ° . , , , 

. . . . the repository for the group node, the top-level node, and all 

Preferably, this technique further comprises retrieving a 55 0 f the intermediate-level nodes in the hierarchical path from 

coalesced result in response to a request from a particular the group node tQ the top . level node; coalescing the 

one of the users, wherein the request may specify a refreshed retrieved data values; storing the coalesced data values as 

result or an unrefreshed result. (he CDI for (he grmip node ifl me central data cache; and 

Retrieving a refreshed result preferably further comprises: setting the CDI timestamp for the CDI to the last update 

retrieving the user data for the particular user; retrieving the 60 timestamp for the corresponding group node. 

CDI for the group node of which the user node is one of the preseD t invention will now be described with refer- 

children; and merging the retrieved user data with the CDI ence to ^ following drawings, in which like reference 

of the parent node. Retrieving the user data preferably numbers denote the same element throughout, 
further comprises: retrieving the user data from the client 

cache when (1) the client cache for the particular user exists 65 BRIEF DESCRIPTION OF THE DRAWINGS 

and (2) the client cache timestamp for the particular user's FIG. 1 is a block diagram example representing a hier- 

client cache is not older than the last update timestamp archically structured organization and a hierarchical struc- 
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ture for storing data associated with the organization in Enterprise Systems Architecture/370 available from the 

which the present invention may be practiced; International Business Machines Corporation ("IBM"), an 

FIG. 2 is a block diagram of a computer workstation Enterprise Systems Architecture/390 computer, etc. 

environment in which the present invention may be prac- Depending on the application, a midrange computer, such as 

t j cec j. S an Application System/400 (also known as an AS/400) may 

_ . . be employed. ("Enterprise Systems Architecture/370" is a 

FIG. 3 ls a diagram of a networked computing environ- tradem ^ k ' of ffiM; «& terpri se Systems Architecture/390", 

ment in which the present invention may be practiced; "Application SystemMOO", and "AS/400" are registered 

FIG. 4 depicts the hierarchy of FIG. 1, showing how a trademarks of IBM.) 

two-tiered cache is used with this hierarchy, according to the ^ ^ gateway computer 346 may also be coupled 349 to a 

present invention; and storage device ( such ^ data repos j tory 348). Further, the 

FIGS. 5-7 provide flow charts illustrating the preferred gateway 346 may be directly or indirectly coupled to one or 

embodiment of the logic used to implement the present more workstations 210. 

invention. Those skilled in the art will appreciate that the gateway 

DESCRIPTION OF THE PREFERRED 15 com P uter 346 may be located a great geographic distance 

EMBODIMENT ^ rom me networ ^ 3^2, and similarly, the workstations 210 

may be located a substantial distance from the networks 342 
FIG. 2 illustrates a representative workstation hardware an d 344. For example, the network 342 may be located in 
environment in which the present invention may be prac- California, while the gateway 346 may be located in Texas, 
ticed. The environment of FIG. 2 comprises a representative M anc j oae or more 0 f tne workstations 210 may be located in 
single user computer workstation 210, such as a personal N ew York. The workstations 210 may connect to the wire- 
computer, including related peripheral devices. The work- j css ne twork 342 using a networking protocol such as the 
station 210 includes a microprocessor 212 and a bus 214 Transmission Control Protocol/Internet Protocol ("TCP/IP") 
employed to connect and enable communication between over a n Um b e r of alternative connection media, such as 
the microprocessor 212 and the components of the work- ^ cellular phone, radio frequency networks, satellite networks, 
station 210 in accordance with known techniques. The etc . The wireless network 342 preferably connects to the 
workstation 210 typically includes a user interface adapter gateway 346 using a network connection 350c such as TCP 
216, which connects the microprocessor 212 via the bus 214 0 r UDP (User Datagram Protocol) over IP, X.25, Frame 
to one or more interface devices, such as a keyboard 218, Re i ay> ISDN (Integrated Services Digital Network), PSTN 
mouse 220, and/or other interface devices 222, which can be 3Q (Public Switched Telephone Network), etc. The worksta- 
any user interface device, such as a touch sensitive screen, tj ons 210 may alternatively connect directly to the gateway 
digitized entry pad, etc. The bus 214 also connects a display 345 usm g dial connections 350b or 350c. Further, the 
device 224, such as an LCD screen or monitor, to the wireless network 342 and network 344 may connect to one 
microprocessor 212 via a display adapter 226. The bus 214 or more other networks (not shown), in an analogous manner 
also connects the microprocessor 212 to memory 228 and 35 t 0 that depicted in FIG. 3. 

long-term storage 230 which can include a hard drive, Software programming code which embodies the present 

diskette drive, tape drive, etc. invention is typically accessed by the microprocessor 212 of 

The workstation 210 may communicate with other com- the workstation 210 and server 347 from long-term storage 

puters or networks of computers, for example via a com- media 230 of some type, such as a CD-ROM drive or hard 

munications channel or modem 232. Alternatively, the work- 40 drive. The software programming code may be embodied on 

station 210 may communicate using a wireless interface at any of a variety of known media for use with a data 

232, such as a CDPD (cellular digital packet data) card. The processing system, such as a diskette, hard drive, or 

workstation 210 may be associated with such other com- CD-ROM. The code may be distributed on such media, or 

puters in a local area network (LAN) or a wide area network may be distributed to users from the memory or storage of 

(WAN), or the workstation 210 can be a client in a client/ 45 one computer system over a network of some type to other 

server arrangement with another computer, etc. All of these computer systems for use by users of such other systems, 

configurations, as well as the appropriate communications Alternatively, the programming code may be embodied in 

hardware and software, are known in the art. the memory 228, and accessed by the microprocessor 212 

FIG. 3 illustrates a data processing network 340 in which using the bus 214. The techniques and methods for embody- 

the present invention may be practiced. The data processing 50 ing software programming code in memory, on physical 

network 340 may include a plurality of individual networks, media, and/or distributing software code via networks are 

such as wireless network 342 and network 344, each of well known and will not be further discussed herein, 

which may include a plurality of individual workstations A user of the present invention at a client computer may 

210. Additionally, as those skilled in the art will appreciate, connect his computer to a server using a wireline 

one or more LANs may be included (not shown), where a 55 connection, or a wireless connection. Wireline connections 

LAN may comprise a plurality of intelligent workstations are those that use physical media such as cables and tele- 

coupled to a host processor. phone lines, whereas wireless connections use media such as 

Still referring to FIG. 3, the networks 342 and 344 may satellite links, radio frequency waves, and infrared waves, 

also include mainframe computers or servers, such as a Many connection techniques can be used with these various 

gateway computer 346 or application server 347 (which may 60 media, such as: using the computer's modem to establish a 

access a data repository 348). A gateway computer 346 connection over a telephone line; using a LAN card such as 

serves as a point of entry into each network 344. The Token Ring or Ethernet; using a cellular modem to establish 

gateway 346 may be preferably coupled to another network a wireless connection; etc. The user's computer may be any 

342 by means of a communications link 350a. The gateway type of computer processor, including laptop, handheld or 

346 may also be directly coupled to one or more worksta- 65 mobile computers; vehicle-mounted devices; desktop com- 

tions 210 using a communications link 350/?, 350c. The puters; mainframe computers; etc., having processing and 

gateway computer 346 may be implemented utilizing an communication capabilities. The remote server, similarly, 
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can be one of any number of different types of computer represented by the hierarchy, the CDI of the user's imme- 

which have processing and communication capabilities. diate parent (i.e. the next higher level of the hierarchy) is 

These techniques are well known in the art, and the hard- retrieved from the central cache and is coalesced with the 

ware devices and software which enable their use are readily uncoalesced image in the client cache. The result is a single, 

available. Hereinafter, the user's computer will be referred 5 coalesced image comprised of all the values provided by the 

to equivalently as a "workstation", "device", or "computer", hierarchy as well as the unique values specified by the user 

and use of any of these terms or the term "server" refers to which ove rride or augment values inherited from the hier- 

any of the types of computing devices described above. arcfly , n this majmcr> ^ voladUty of ^ cachc [& 

In the preferred embodiment, the present invention is significantly reduced since each CDI does not include the 

implemented as one or more modules (also referred to as 10 morc volatilc clicnt information The of mcmory is 

code subroutines, or "objects" in object-oriented sigiuficantly reduced since multiple users (such as Sue 140 

programming) of a computer software program (or md Bob 145) ^ Q ^ m the same CDI By caching a 

programs) which provides a two-tiered caching technique to coa lesced image representing all levels of a hierarchy above 

avoid performance impacts and to avoid stormg large a particular user (such as CDI 400 for the Development level 

amounts of data in memory. The invention may be used in 15 10s above ^ Bob 135 ) t or multiple users (such as CDI 405 

the Internet environment or in a corporate intranet, extranet, for ^ User i nter f a ce level 115 above user Sue 140 and Tim 

or any other network environment. The program code of the 14S ) ^ mea applying a user's uncoalesced data to the 

preferred embodiment may be implemented as objects in an appropriate CDI, the user obtains a full set of information 

object-oriented programming language, or in a conventional only two storage rctricvals ( onc for tne C DI, and one 

procedurally-onented language, or m a mix of object- 20 for the m&T > s uncoa iesced data)— assuming the CDI is 

oriented and procedural language code. In the preferred up-to-date 

embodiment, the code of the present invention operates on Kr . t ... , . ... , , . . ,. t , 

. . i # . , - , . Note that while the hierarchy discussed herein is directed 

the server. Alternatively, a portion of the code may operate A , c ... - . , 

A , , u- / * -r iL i • r toward configuration data in an organization, this is merely 

on the client machine (e.g. to perform the coalescing of a .,. . r „ . 1Jt . * , . ,/ 

r . , K , to , . l . , - ... , , \ illustrative. Atypical data store may contain hierarchically 

user s preferences with data inherited from higher levels). . . , . r Jr t . ■ , . c } 

r , . . 25 structured information about various types of resources. 

A server machine m the client/server environment typi- (Examples of a resource could be a uscr , groupj macnine , 

cally maintains frequently accessed information m a system ^ From ^ ^ int of vie% these fesources are 

memory cache for improved performance. For complex data , ica]1 iQ a hierarchical group structure . So> in the 

structures such as a hierarchy of data as has been discussed, d ^ a fesource m[ M be a member of a and 

the cached image may be comprised of the coalesced data 30 ^ mi ht be a member of fl hi her ^ etc 

from all levels of the hierarchy. The problem with main- _ * . . 

taining a cached image for a full hierarchy of data is that the ^ P' eferred "nbodiment of ^ present invention will 

image must be recalculated (i.e. re-coalesced) each time a now bc discussed wth rc 8 ard to FIGS * 4 " 7 * 

change is made to the information which is represented by F IG - 4 shows how the two-tiered cache of the present 

the cached image, in order to incorporate the changes into 35 invention is used with hierarchically stored data such as the 

the image. With reference to the example hierarchy of FIG. example depicted in FIG. 1. As discussed earlier, a coalesced 

1, a coalescence of the full hierarchy (according to the prior data i^ge °f hierarchical data is kept for all levels of the 

art) would exist for each user, to include unique information hierarchy which have users beneath them. Thus, in FIG. 4, 

for this user. The coalescing of data for each user may result a coalesced data image 400, 405, is kept in the central cache 

in multiple cached images that are essentially the same, 40 for the "Development" 105 and "User Interface" 115 nodes 

except for the individual user's unique values. In addition, because these groups have users Bob 135, and Sue 140 and 

coalesced images may be stored for each intermediate level Tim 145, respectively. A client cache is kept for each user 

(representing the hierarchical path to the top of the (Bob 135, Sue 140, Tim 145), and this client cache provides 

structure). System performance is affected by all the uncoalesced data (410, 415, 420, respectively) to override 

required frequent coalescing of data and the memory utili- 45 tne data stored in the CDI for the user's immediate parent, 

zation is affected since multiple versions of the hierarchical The omer Qodes of tne example hierarchy have no users 

data are stored in the system memory cache. depicted, and thus none of these nodes has an associated 

According to the preferred embodiment of the present CDL ^ coalesced data image 400 for the Development 

invention, this problem is solved by keeping two levels of S rou P 105 IS comprised of all information from the Devel- 

cache: (1) a central cache of coalesced images, and (2) a 50 °P ment § rou P ^ aU levels above il ( U ' the Com P an y 100 )* 

client cache for uncoalesced client images. In the preferred ^ coalesced data image 405 for the User Interface group 

embodiment, these two caches exist in the same physical 115 is comprised of all the information from the User 

memory cache on the server machine and are logically Interface group 115, the Development group 105, and the 

separated. Optionally, these two caches may be physically Company 100. When user Sue 140, for example, requests 

separate memory caches on the same server machine or may 55 lhe data from the hierarchy, the CDI 405 from her 

be located on physically separate machines (including stor- immediate parent (User Interface 115) is coalesced with the 

ing a client's uncoalesced data locally on the client ^coalesced data 415 from Sue's client cache and returned 

machine). 10 S * e 140 

According to the present invention, a coalesced data The coalesced data in a CDI typically changes infre- 

image (CDI) of hierarchical data is kept for all levels of the 60 quently since it contains data provided at higher, less volatile 

hierarchy which have users (or more generally, terminating levels of a hierarchy. That is, it can be expected that a 

resources) beneath them. Each user who provides custom- department changes its information less frequently than does 

ized data to override the inherited data in a CDI has his a user, mat a clivision changes its information less frequently 

unique changes saved in a client cache (where the changes m an a department, etc. 

are stored uncoalesced). (A user's unique changes are 65 According to the present invention, CDIs stored in the 

referred to herein as the user's "uncoalesced data".) When central cache are read-only in that changes are never made 

the user requests the coalesced data for all the information by users directly in the data represented by a CDI. At some 
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point in time, an administrator may have a need to change 600) issues a request for the coalesced preferences (as 

some or all of the data in the hierarchy. (Administrators before), but now requests a refresh of any out-of-date cached 

typically have authority to make changes to data at various values. Block 605 determines if the client cache for this user 

levels of the hierarchy, and do not have the same limitations is up-to-date. A client cache for the requesting user may not 

as other users.) Should an administrator change higher level s exist when the request for a refresh is received. In that case, 

data (i.e. non-user-level data such as corporate-level data, a clieDl cache fe built which » b Y default, is refreshed, 

division-level data, etc.) that is stored on the server, the CDI Otherwise, a comparison is made between the timestamp of 

will be out-of-date with the stored data. By requesting the th * user ' s ^ ent ^ ache ^ { } c tunesUmp indicating the last 

hierarchical data with the refresh option (discussed in detail u P da f e 10 ^ ^°red data in the data repository. If the 

below), the present invention will cause the server to 10 'ache for this user is . not up -to-date (ix the 

ii i j , ■ * , • » - client cache timestamp is older than the timestamp of the last 

re-coalesce all to data into a new coalesced data image in date tQ ^ user >/ stored data) then at Blo£ £ 610j any 

the central cache. Note that multiple new CDIs may be e)dstin client cache data fc ^dtd. The user's chent 

created durmg a refresh, when the changed data is repre- cache ^ then populated ^ currentj uncoa i eS ced data, and 

sented in more than one CDI. For example, information mc umcstamp indicating the last time the user's stored data 

denned at Company level 100 is reflected in CDI 400 as well 15 was updated is saved in the client cache. If the user's client 

as CDI 405. Similarly, information defined at Development cacne an d i s not out-of-date, processing continues to 

level 105 is reflected in both of these CDIs. Thus, if an Block 615. 

administrator changes Company 100 data or Development Block 615 determines if the CDI for the parent group 

105 data, both CDIs will become out-of-date and will be exists in the central cache and is up-to-date. As with the 

re-coalesced on a refresh issued from a level beneath User 20 client cache, if the CDI for the parent group does not exist, 

Interface 115. then a new parent CDI is created in the central cache at 

The logic with which the preferred embodiment of the Block 620 which, by default, is refreshed. Or, if the CDI for 

present invention is implemented will now be discussed with thc P arcnt &°*V * not up-to-date (i.e. the CDI timestamp for 

reference to FIGS 5-7 me P arenl g rcm P is older than the timestamp of the last 

m . " „ , .„ . . , . • . 25 update at this level in the repository), then Block 620 

FIG. 5 prov,des a flow chart illustrating the logic with discafds exjstin CDI for , he m &nd hers 

which a user requests coalesced data from a server without all , he ^ for ^ higher .i 6ve i groups ^ me hierarchy from 

requesting a refresh. Id this example, the user Joe at Block , he it inc|udi theparen t group . The collected data 

500 issues a request to a server for data that is coalesced fa coalesced into , (new or refresh ed) CDI, and the central 

from complex hierarchical data. The desired coalesced data Mche js populated with this CDI . Block 62s saves i he 

in this example is comprised of all the data stored in the , imestamp associated wilh this C DI in thc central cache, 

hierarchy including any data previously specified by Joe where ^ CD[ timesl is lh( . lime of thf . lasl u ^ ate , 0 

(e g. additional values or overridden values). At Block 505, daU for ^ Ieve] Qr for Qne of ^ hi ^ er . levcl 0 

a determination is made as to whether a client cache exists coalesced int0 this CDIi whicheV er is later, in the repository, 

for this user Joe. If there is no chent cache (a No at Block Optionally, this checking of the parent group CDI may be 

505), then at Block 510 the client's unco alesced data is ted for each Qext . higher . level grOTjp CDI in ^ paui t0 

retrieved from thedatastore(i.e.diskfile,database,etc.)and the t level> UQtil rMchi a CDI which is up . t0 ^ ate . 

used to populate a client cache entry for this user. A „, , ,, A . , , c ., ... u , . 

r r ,. ... ...... , Block 630 is reached following a positive result at Block 

Umestamp representing the last update to to user s uncoa- fil5 Qr letion of Block 625 . At Block 630, the CDI of 

lesced data in the repository is also stored-in the user s client , r , , ... , , 

u ir * 1 fn* i- . . n « . , . 40 the user s parent group is coalesced with the uncoalesced 

cache. If, at Block 505, a chent cache for this user does exist , 4 . r , *%. , , , . 4 , . 

/ mr » i.\ • * tii i ffie data in the user s client cache, and the result is returned to 

(a Yes result), then processing continues to Block 515. ^ e 

At Block 515, a test is made to determine if a CDI for the "pjJJ^ ^ a flowchart SSagUlj6ng me logic ^ 

parent (..e the next higher level above the requesting user) whjch ^ f erred einbodimem processe s a user request to 

exists in the central cache. If not then processing continues 45 dat6 da(a stored in ^ data ito At Bbck 700> thc 

to Block 520 where all the data for the h>gher-level groups ^ a ( to date ^ which fa stored in a 

in the hierarchy is gathered^ from the current user s parent r6pository . At Block 705> a determination is made as to 

level to the highest level. The collected information is then whether me da(a feei da(ed existfi in ^ cUent ^ fof 

coalesced and stored as a CDIin the central cache. A thjs ^ , f not> tten at Block 710 the client cache is 

Umestamp associated with this CDI is saved in the central 50 laled with lhe existing uncoalesced data and the times- 

cache at Block 525, where the value of this timestamp is set , of the 

last update to the user's stored data is saved in 

to the last update time of the data at this level in the the cli ent cache, tenser's specified updates are made to the 

repository. stored data as well ^ t0 tne client cache at Block 715 At 

If a CDI for the parent group exists (a "Yes" result at Block 720, the timestamp of the client's data in the reposi- 

Block 515), or after creating a CDI at Blocks 520 and 525, 55 tory as well as the client cache umestamp are updated to 

processing continues at Block 530. At Block 530, the user's re fl ect the time of the current update. Block 725 determines 

uncoalesced data in the client cache is coalesced with the if the updated information is group -level data. If so, then an 

CDI for the user's parent, and the result is returned to the updated timestamp is propagated downward (Block 730) to 

user. Note that the logic of FIG. 5 made no determination as all subordinate groups of the hierarchy below this updated 

to whether values in an existing CDI were out-of-date with 60 groupt That is, the last update time for the subordinate 

the corresponding data stored in the data repository since the groups in the repository is set to the time of the current 

user's request (Block 500) did not request a refresh of any update. By changing the timestamps of the stored data for 

previously cached values. the group and its subordinate groups, a refresh of any CDIs 

FIG. 6 provides a flow chart illustrating the preferred for these subordinate data levels will occur with the nekt 

embodiment of the logic for processing a user's request for 65 user request that specifies a refresh. Block 735 returns an 

coalesced data which will be refreshed (if the appropriate indication of success or failure of the update operation to the 

CDI and/or client cache is out-of-date). The user (at Block user. 
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In an optional enhancement of the preferred embodiment, 
a random ly-gene rated sequence number may be used as the 
"timestamp" values which have been discussed herein, 
instead of a conventional time value. This enhancement will 
be advantageous when more than one server application may 
be updating data values in the repository (and thus updating 
the associated timestamps), and enables avoiding clock 
synchronization discrepancies among the different applica- 
tions. When this enhancement is used, the comparisons 
between timestamp values to determine if a stored CDI or 
client cache is up-to-date is replaced by a comparison to 
determine whether the timestamp values are different; if the 
values are different, then the stored data has been changed 
and thus is not up-to-date. 

As has been demonstrated, the present invention provides 15 
advantageous techniques for storing and retrieving hierar- 
chically structured data by creating a two-tiered cache. The 
technique defined herein for creating and accessing this 
two-tiered cache avoids the performance penalty and storage 
penalty of prior art solutions. 

While the preferred embodiment of the present invention 
has been described, additional variations and modifications 
in that embodiment may occur to those skilled in the art once 
they learn of the basic inventive concepts. Therefore, it is 
intended that the appended claims shall be construed to 
include both the preferred embodiment and all such varia- 
tions and modifications as fall within the spirit and scope of 
the invention. 

We claim: 

1. A computer program product embodied on a computer- 
readable medium in a computing environment, for using a 
two-tiered cache for storing and accessing hierarchical data, 
comprising: 

a hierarchical structure of data comprising a top-level 
node, one or more intermediate levels having one or 
more intermediate-level nodes, and one or more user 
nodes, wherein each of said user nodes is a child node 
of said top-level node or one of said intermediate-level 
nodes, wherein said hierarchical structure is stored in a 
data repository accessible in said computing environ- 
ment and wherein each of said nodes has a correspond- 
ing last update timestamp stored in said repository, said 
last update timestamp for said user nodes and said 
top-level node representing a last update to one or more 
data values of said node and said last update timestamp 
for said intermediate nodes representing an update of 
data values of said intermediate node or a parent of said 
intermediate node; 

computer-readable program code means for creating coa- 
lesced data images (CDls) for each of said top-level or 
intermediate-level nodes which is a group node, 
wherein a particular node is one of said group nodes 
when said particular node has one or more of said user 
nodes as a child, and wherein said CDI for said 
particular node comprises a coalescence of data values 
for said particular node, said top-level node, and all of 
said intermediate-level nodes in a hierarchical path 
from said particular node to said top-level node; 

computer-readable program code means for storing said 
created CDIs in a central data cache along with a CDI 
timestamp for each of said stored CDIs wherein said 
CDI timestamp for each of said CDIs is set to said 
corresponding last update timestamp for said corre- 
sponding node; and 

computer-readable program code means for storing user 
data for each of one or more users in a client cache for 
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said user along with a client cache timestamp, wherein: 
each of said users is associated with a selected one of 
said user nodes; said client cache timestamp is set to 
said corresponding last update timestamp for said cor- 
responding user node; and said stored user data is 
uncoalesced. 

2. The computer program product according to claim 1, 
further comprising: 

computer readable program code means for updating one 
or more of said data values for a selected node, further 
comprising: 

computer readable program code means for applying an 
update to said data values in said repository; 

computer readable program code means for updating 
said last update timestamp corresponding to said 
selected node; 

computer readable program code means for determin- 
ing whether said selected node is one of said group 
nodes; and 

computer readable program code means for propagat- 
ing said updated timestamp to each of said group 
nodes subordinate to said selected group node in said 
hierarchical structure when said computer readable 
program code means for determining has a positive 
result. 

3. The computer program product according to claim 2, 
further comprising: 

computer readable program code means for retrieving a 
coalesced result in response to a request from a par- 
ticular one of said users, wherein said request may 
specify a refreshed result or an unrefreshed result. 

4. The computer program product according to claim 3, 
wherein said computer readable program code means for 
retrieving a refreshed result further comprises: 

computer readable program code means for retrieving 
said user data for said particular user, further compris- 
ing: 

computer readable program code means for retrieving 
said user data from said client cache when (1) said 
client cache for said particular user exists and (2) 
said client cache timestamp for said particular user's 
client cache is not older than said last update times- 
tamp corresponding to said particular user's user 
node in said repository; and 
computer readable program code means for populating 
said particular user's client cache otherwise, wherein 
said populating further comprises: 
computer readable program code means for retriev- 
ing said user data from said particular user's user 
node in said repository; 
computer readable program code means for storing 
said retrieved user data in said client cache for said 
user; and 

computer readable program code means for setting 
said client cache timestamp for said user to said 
last update timestamp for said corresponding user 
node; and 

computer readable program code means for retrieving 
said CDI for said group node of which said user node 
is one of said children, further comprising: 
computer readable program code means for retrieving 
said CDI from said central data cache when (1) said 
CDI for said group node exists and (2) said CDI 
timestamp for said CDI is not older than said last 
update timestamp corresponding to said group node 
in said repository; and 
computer readable program code means for creating 
said CDI otherwise, wherein said creating further 
comprises: 
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computer readable program code means for retriev- 
ing said data values from said repository for said 
group node, said top-level node, and all of said 
intermediate -level nodes in said hierarchical path 
from said group node to said top-level node; 

computer readable program code means for coalesc- 
ing said retrieved data values; 

computer readable program code means for storing 
said coalesced data values as said CDI for said 
group node in said central data cache; and 

computer readable program code means for setting 
said CDI timestamp for said CDI to said last 
update timestamp for said corresponding group 
node; and 

computer readable program code means for merging said 15 
retrieved user data with said retrieved CDI. 

5. The computer program product according to claim 3, 
wherein said computer readable program code means for 
retrieving an unrefreshed result further comprises: 

computer readable program code means for retrieving 2 rj 
said user data for said particular user, further compris- 
ing: 

computer readable program code means for retrieving 
said user data from said client cache when said client 
cache for said particular user exists; and 
computer readable program code means for populating 
said particular user's client cache otherwise, wherein 
said populating further comprises: 
computer readable program code means for retriev- 
ing said user data from said particular user's user 30 
node in said repository; 
computer readable program code means for storing 
said retrieved user data in said client cache for said 
user; and 

computer readable program code means for setting 35 
said client cache timestamp for said user to said 
last update timestamp for said corresponding user 
node; 

computer readable program code means for retrieving 
said CDI for said group node of which said user node 40 
is one of said children, further comprising: 
computer readable program code means for retrieving 

said CDI from said central data cache when said CDI 

for said group node exists; and 
computer readable program code means for creating 45 

said CDI otherwise, wherein said creating further 

comprises: 

computer readable program code means for retriev- 
ing said data values from said repository for said 
group node, said top-level node, and all of said 50 
intermediate-level nodes in said hierarchical path 
from said group node to said top-level node; 

computer readable program code means for coalesc- 
ing said retrieved data values; 

computer readable program code means for storing 55 
said coalesced data values as said CDI for said 
group node in said central data cache; and 

computer readable program code means for setting 
said CDI timestamp for said CDI to said last 
update timestamp for said corresponding group 60 
node; and 

computer readable program code means for merging said 
retrieved user data with a parent CDI associated with a 
parent node of said user node. 

6. The computer program product according to claim 4, 65 
wherein said computer readable program code means for 
creating further comprises: 



computer readable program code means for repeating said 
creating for each of said group nodes above said group 
node in said hierarchical path until reaching a first of 
said group nodes for which said CDI timestamp for said 
CDI of said first group node is not older than said last 
update timestamp corresponding to said group node in 
said repository. 

7. The computer program product according to claim 4, 
wherein: 

said computer readable program code means for retriev- 
ing said user data from said client cache determines 
whether said client cache timestamp for said particular 
user's client cache is different from said last update 
timestamp corresponding to said particular user's user 
node in said repository rather than whether said client 
cache timestamp is not older than said last update 
timestamp; and 

said computer readable program code means for retriev- 
ing said CDI from said central data cache determines 
whether said CDI timestamp for said CDI is different 
from said last update timestamp corresponding to said 
group node in said repository rather than whether said 
CDI timestamp is not older than said last update 
timestamp. 

8. A system for using a two-tiered cache for storing and 
accessing hierarchical data, comprising: 

a hierarchical structure of data comprising a top-level 
node, one or more intermediate levels having one or 
more intermediate-level nodes, and one or more user 
nodes, wherein each of said user nodes is a child node 
of said top-level node or one of said intermediate-level 
nodes, wherein said hierarchical structure is stored in a 
data repository accessible in said computing environ- 
ment and wherein each of said nodes has a correspond- 
ing last update timestamp stored in said repository, said 
last update timestamp for said user nodes and said 
top-level node representing a last update to one or more 
data values of said node and said last update timestamp 
for said intermediate nodes representing an update of 
data values of said intermediate node or a parent of said 
intermediate node; 

means for creating coalesced data images (CDIs) for each 
of said top-level or intermediate-level nodes which is a 
group node, wherein a particular node is one of said 
group nodes when said particular node has one or more 
of said user nodes as a child, and wherein said CDI for 
said particular node comprises a coalescence of data 
values for said particular node, said top-level node, and 
all of said intermediate -level nodes in a hierarchical 
path from said particular node to said top-level node; 

means for storing said created CDIs in a central data cache 
along with a CDI timestamp for each of said stored 
CDIs wherein said CDI timestamp for each of said 
CDIs is set to said corresponding last update timestamp 
for said corresponding node; and 

means for storing user data for each of one or more users 
in a client cache for said user along with a client cache 
timestamp, wherein: each of said users is associated 
with a selected one of said user nodes; said client cache 
timestamp is set to said corresponding last update 
timestamp for said corresponding user node; and said 
stored user data is uncoalesced. 

9. The system according to claim 8, further comprising: 
means for updating one or more of said data values for a 

selected node, further comprising: 
means for applying an update to said data values in said 
repository; 
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means for updating said last update limestamp corre- 
sponding to said selected node; 

means for determining whether said selected node is 
one of said group nodes; and 

means for propagating said updated limestamp to each 
of said group nodes subordinate to said selected 
group node in said hierarchical structure when said 
means for determining has a positive result. 

10. The system according to claim 9, further comprising: 
means for retrieving a coalesced result in response to a 

request from a particular one of said users, wherein said 
request may specify a refreshed result or an unrefreshed 
result. 

11. The system according to claim 10, wherein said means 
for retrieving a refreshed result further comprises: 

means for retrieving said user data for said particular user, 
further comprising: 

means for retrieving said user data from said client 
cache when (1) said client cache for said particular 
user exists and (2) said client cache timestamp for 
said particular user's client cache is not older than 
said last update timestamp corresponding to said 
particular user's user node in said repository; and 

means for populating said particular user's client cache 
otherwise, wherein said populating further com- 
prises: 

means for retrieving said user data from said par- 
ticular user's user node in said repository; 
means for storing said retrieved user data in said 

client cache for said user; and 
means for setting said client cache timestamp for 
said user to said last update timestamp for said 
corresponding user node; 
means for retrieving said CDI for said group node of 
which said user node is one of said children, further 
comprising: 

means for retrieving said CDI from said central data 
cache when (1) said CDI for said group node exists 
and (2) said CDI timestamp for said CDI is not older 
than said last update timestamp corresponding to 40 
said group node in said repository; and 

means for creating said CDI otherwise, wherein said 
creating further comprises: 

means for retrieving said data values from said 
repository for said group node, said top-level 45 
node, and all of said intermediate-level nodes in 
said hierarchical path from said group node to said 
top-level node; 

means for coalescing said retrieved data values; 

means for storing said coalesced data values as said 50 
CDI for said group node in said central data cache; 
and 

means for setting said CDI timestamp for said CDI 
to said last update timestamp for said correspond- 
ing group node; and 55 
means for merging said retrieved user data with said CDI 
of said parent node. 

12. The system according to claim 10, wherein said means 
for retrieving an unrefreshed result further comprises: 

means for retrieving said user data for said particular user, 60 
further comprising: 

means for retrieving said user data from said client 
cache when said client cache for said particular user 
exists; and 

means for populating said particular user's client cache 65 
otherwise, wherein said populating further com- 
prises: 
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means for retrieving said user data from said par- 
ticular user's user node in said repository; 
means for storing said retrieved user data in said 

client cache for said user; and 
means for setting said client cache timestamp for 
said user to said last update timestamp for said 
corresponding user node; 
means for retrieving said CDI for said group node of 
which said user node is one of said children, further 
comprising: 

means for retrieving said CDI from said central data 
cache when said CDI for said group node exists; and 

means for creating said CDI otherwise, wherein said 
creating further comprises: 

means for retrieving said data values from said 
repository for said group node, said top-level 
node, and all of said intermediate-level nodes in 
said hierarchical path from said group node to said 
top-level node; 

means for coalescing said retrieved data values; 

means for storing said coalesced data values as said 
CDI for said group node in said central data cache; 
and 

means for setting said CDI timestamp for said CDI 
to said last update timestamp for said correspond- 
ing group node; and 
means for merging said retrieved user data with a parent 
CDI associated with a parent node of said user node. 

13. The system according to claim 11, wherein said means 
for creating further comprises: 

means for repeating said creating for each of said group 
nodes above said group node in said hierarchical path 
until reaching a first of said group nodes for which said 
CDI timestamp for said CDI of said first group node is 
not older than said last update timestamp corresponding 
to said group node in said repository. 

14. The system according to claim 11, wherein: 

said means for retrieving said user data from said client 
cache determines whether said client cache timestamp 
for said particular user's client cache is different from 
said last update timestamp corresponding to said par- 
ticular user's user node in said repository rather than 
whether said client cache timestamp is not older than 
said last update timestamp; and 

said means for retrieving said CDI from said central data 
cache determines whether said CDI timestamp for said 
CDI is different from said last update timestamp cor- 
responding to said group node in said repository rather 
than whether said CDI timestamp is not older than said 
last update timestamp. 

15. A method for using a two-tiered cache for storing and 
accessing hierarchical data, comprising the steps of: 

providing a hierarchical structure of data comprising a 
top-level node, one or more intermediate levels having 
one or more intermediate-level nodes, and one or more 
user nodes, wherein each of said user nodes is a child 
node of said top-level node or one of said intermediate- 
level nodes, wherein said hierarchical structure is 
stored in a data repository accessible in said computing 
environment and wherein each of said nodes has a 
corresponding last update timestamp stored in said 
repository, said last update timestamp for said user 
nodes and said top-level node representing a last update 
to one or more data values of said node and said last 
update timestamp for said intermediate nodes repre- 
senting an update of data values of said intermediate 
node or a parent of said intermediate node; 
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creating coalesced data images (CDIs) for each of said 
top-level or intermediate-level nodes which is a group 
node, wherein a particular node is one of said group 
nodes when said particular node has one or more of said 
user nodes as a child, and wherein said CDI for said 
particular node comprises a coalescence of data values 
for said particular node, said top-level node, and all of 
said intermediate-level nodes in a hierarchical path 
from said particular node to said top-level node; 

storing said created CDIs in a central data cache along 
with a CDI timestamp for each of said stored CDIs 
wherein said CDI timestamp for each of said CDIs is 
set to said corresponding last update timestamp for said 
corresponding node; and 

storing user data for each of one or more users in a client 
cache for said user along with a client cache timestamp, 
wherein: each of said users is associated with a selected 
one of said user nodes; said client cache timestamp is 
set to said corresponding last update timestamp for said 
corresponding user node; and said stored user data is 
uncoalesced. 

16. The method according to claim 15, further comprising 
the step of: 

updating one or more of said data values for a selected 
node, further comprising the steps of: 
applying an update to said data values in said reposi- 
tory; 

updating said last update timestamp corresponding to 

said selected node; 
determining whether said selected node is one of said 

group nodes; and 
propagating said updated timestamp to each of said 

group nodes subordinate to said selected group node 

in said hierarchical structure when said determining 

step has a positive result. 

17. The method according to claim 16, further comprising 
the step of: 

retrieving a coalesced result in response to a request from 
a particular one of said users, wherein said request may 40 
specify a refreshed result or an unrefreshed result. 

18. The method according to claim 17, wherein said 
retrieving a refreshed result step further comprises the steps 
of: 

retrieving said user data for said particular user, further 45 
comprising the steps of: 

retrieving said user data from said client cache when 
(1) said client cache for said particular user exists 
and (2) said client cache timestamp for said particu- 
lar user's client cache is not older than said last 
update timestamp corresponding to said particular 
user's user node in said repository; and 

populating said particular user's client cache otherwise, 
wherein said populating further comprises the steps 
of: 

retrieving said user data from said particular user's 

user node in said repository; 
storing said retrieved user data in said client cache 

for said user; and 
setting said client cache timestamp for said user to 

said last update timestamp for said corresponding 

user node; 

retrieving said CDI for said group node of which said user 
node is one of said children, further comprising the 
steps of: 

retrieving said CDI from said central data cache when 
(1) said CDI for said group node exists and (2) said 
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CDI timestamp for said CDI is not older than said 

last update timestamp corresponding to said group 

node in said repository; and 
creating said CDI otherwise, wherein said creating 

further comprises the steps of: 

retrieving said data values from said repository for 
said group node, said top-level node, and all of 
said intermediate-level nodes in said hierarchical 
path from said group node to said top-level node; 

coalescing said retrieved data values; 

storing said coalesced data values as said CDI for 
said group node in said central data cache; and 

setting said CDI timestamp for said CDI to said last 
update timestamp for said corresponding group 
node; and 

merging said retrieved user data with said CDI of said 
parent node. 

19. The method according to claim 17, wherein said 
retrieving an unrefreshed result step further comprises the 
steps of: 

retrieving said user data for said particular user, further 
comprising the steps of: 

retrieving said user data from said client cache when 
said client cache for said particular user exists; and 

populating said particular user's client cache otherwise, 
wherein said populating step further comprises the 
steps of: 

retrieving said user data from said particular user's 

user node in said repository; 
storing said retrieved user data in said client cache 

for said user; and 
setting said client cache timestamp for said user to 

said last update timestamp for said corresponding 

user node; 

retrieving said CDI for said group node of which said user 
node is one of said children, further comprising the 
steps of: 

retrieving said CDI from said central data cache when 

said CDI for said group node exists; and 
creating said CDI otherwise, wherein said creating step 

further comprises the steps of: 

retrieving said data values from said repository for 
said group node, said top-level node, and all of 
said intermediate-level nodes in said hierarchical 
path from said group node to said top-level node; 

coalescing said retrieved data values; 

storing said coalesced data values as said CDI for 
said group node in said central data cache; and 

setting said CDI timestamp for said CDI to said last 
update timestamp for said corresponding group 
node; and 

merging said retrieved user data with a parent CDI 
associated with a parent node of said user node, 

20. The method according to claim 18, wherein said 
creating step further comprises the step of: 

repeating said creating step for each of said group nodes 
above said group node in said hierarchical path until 
reaching a first of said group nodes for which said CDI 
timestamp for said CDI of said first group node is not 
older than said last update timestamp corresponding to 
said group node in said repository. 

21. The method according to claim 18, wherein: 

said retrieving said user data from said client cache step 
determines whether said client cache timestamp for 
said particular user's client cache is different from said 
last update timestamp corresponding to said particular 
user's user node in said repository rather than whether 
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said client cache timestamp is not older than said last ing to said group node in said repository rather than 

update timestamp; and whether said CDI timestamp is not older than said last 

said retrieving said CDI from said central data cache step update timestamp. 
determines whether said CDI timestamp for said CDI is 

different from said last update timestamp correspond- ***** 
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[57] ABSTRACT 

A method of browsing the Worldwide Web of the Internet 
using an HTML-compliant client supporting a graphical user 
interface and a browser. The method begins as a web page 
is being displayed on the graphical user interface, the web 
page having at least one link to a hypertext document 
preferably located at a remote server. In response to the user 
clicking on the link, the link is activated by the browser to 
thereby request downloading of the hypertext document 
from the remote server to the graphical user interface of the 
client. While the client waits for a reply and/or as the 
hypertext document is being downloaded, the browser dis- 
plays one or more different types of informational messages 
to the user. Such messages include, for example, advertise- 
ments, notices, messages, copyright information and the 
like. 
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WEB BROWSER WITH DYNAMIC DISPLAY 
OF INFORMATION OBJECTS DURING 
LINKING 

TECHNICAL FIELD 5 

This invention relates generally to computer networks and 
more particularly to methods for enhancing the operation of 
a client browser operating in a multi-server computer envi- 
ronment. JQ 

BACKGROUND OF THE INVENTION 

The worldwide network of computers commonly known 
as the "Internet" has seen explosive growth in the last 
several years. Mainly, this growth has been fueled by the 15 
introduction and widespread use of so-called "web" brows- 
ers, which allow for simple graphical user interface (GUI)- 
based access to network servers, which support documents 
formatted as so-called "web pages". The "World Wide Web" 
(WWW) is that collection of servers of the Internet that 20 
utilize the Hypertext Transfer Protocol (HTTP). HTTP is a 
known application protocol that provides users access to 
files (which can be in different formats such as text, graph- 
ics, images, sound, video, etc.) using a standard page 
description language known as Hypertext Markup Language 25 
(HTML). HTML provides basic document formatting and 
allows the developer to specify "links" to other servers and 
files. Use of an HTML-compliant client browser involves 
specification of a link via a Uniform Resource Locator or 
"URL". Upon such specification, the client makes a tcp/ip 30 
request to the server identified in the link and receives a 
"web page" (namely, a document formatted according to 
HTML) in return. 

There is a finite time period between the time the user 
initiates the link and the return of the web page. Even when 35 
the web page is returned quickly, there is an additional time 
period during which formatting information must be pro- 
cessed for display on the display interface. For example, 
most web browsers display in-line images (namely images 
next to text) using an X bit map (XBM) or .gif format Each 40 
image takes time to process and slows downs the initial 
display of the document. The user typically "sees" an 
essentially unrecognizable "image" on the display screen 
which only gradually comes into focus. It is only after the 
entire image is downloaded from the server and then pro- 45 
cessed by the browser that the user can fully access the web 
page itself. This "waiting" period is even longer when the 
client machine has a relatively slow modem, and often the 
user will have to wait many seconds before being able to see 
the in-line image and/or begin using the web page. This 50 
problem will be exacerbated when the next generation 
browser technology (such as Netscape Navigator 2.0) 
becomes more widely implemented because such browsers 
are being designed to handle much more complex download 
formats (for more interactive, dynamic displays). 55 

BRIEF SUMMARY OF THE INVENTION 

It is thus a primary object of the invention to enhance the 
operation of a web browser by causing the display of some 60 
useful information to the user during the period of user 
"downtime" that otherwise occurs between linking and 
downloading of a hypertext document identified by the link. 
Such information may include, without limitation, adver- 
tisements, messages, fill-in forms, notices from a service 65 
provider, notices from another Internet service (such as 
receipt of an e-mail message), or some third party notice. 
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It is another more particular object of the invention to use 
an Hypertext Markup Language comment (e.g., via an 
JfTML comment tag) in a web page to store an information 
object related to a link and then formatting and displaying 
such information when the link is activated. 

It is still another object of the invention to embed an 
information object within an existing web page so that the 
object is masked until a link to another web page is acti- 
vated. Upon activation, the object is displayed to the user 
effectively as a "mini" web page while the browser calls the 
link and awaits for a reply and download. 

For example, in one particular embodiment, the informa- 
tion object includes copyright management information for 
a hypertext document associated with a link in a currently- 
displayed page. Such information may include the name or 
other identifying information of a copyright owner, terms 
and conditions for uses of the work within the hypertext 
document, and such other information as may be prescribed 
or desired. When the user "hits" the link in the current page, 
the copyright management information (which is already 
present in the browser) is displayed as the new document is 
being accessed and downloaded. The copyright management 
information, for example, may inform the user of the terms 
and conditions of how the copyrighted content being down- 
loaded can then be reused. The "time" period normally 
associated with the download is thus productive for both the 
user (since he or she no longer has to sit and wait for the 
display) as well as to the content provider. 

According to the preferred embodiment, there is 
described a method of browsing the Worldwide Web of the 
Internet using an HTML-compliant client supporting a 
graphical user interface and a browser. The method begins as 
a web page is being displayed on the graphical user inter- 
face, the web page having at least one link to a hypertext 
document preferably located at a remote server. In response 
to the user clicking on the link, the link is activated by. the 
browser to thereby request downloading of the hypertext 
document from the remote server to the graphical user 
interface of the client. While the client waits for a reply 
and/or as the hypertext document is being downloaded, the 
browser displays one or more different types of informa- 
tional messages to the user. Such messages include, without 
limitation, advertisements, notices, messages, fill-in forms, 
copyright information and the like. Preferably, the message 
information is in some way related to the hypertext docu- 
ment being accessed and downloaded, as in the case of 
copyright management information perhaps warning the 
user that the material being downloaded is subject to certain 
use restrictions of the copyright owner. Where the displayed 
information is related to the link, it is desirable that such 
information be embedded within the web page from which 
the link is launched. The information is preferably "hidden" 
within the web page using a hypertext markup comment tag. 

The invention is preferably implemented in a computer 
having a processor, an operating system, a graphical user 
interface and a HTTP-compliant browser. In such case, the 
novel and advantageous features of the invention are 
achieved using a first means, responsive to activation of a 
link from a web page, for retrieving an information object 
masked within the web page, and a second means for 
displaying information from the information object on the 
graphical user interface as the browser establishes the link. 
Preferably, the information object is masked by an HTML 
comment tag, which may include other HTML tags nested 
therein to format the information in the object. This enables 
the support of complex "mini" web pages that are displayed 
and accessible to the viewer during otherwise nonproductive 
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periods when the browser is busy processing links to other 
documents or web sites. 

The foregoing has outlined some of the more pertinent 
objects of the present invention. These objects should be 
construed to be merely illustrative of some of the more 5 
prominent features and applications of the invention. Many 
other beneficial results can be attained by applying the 
disclosed invention in a different manner or modifying the 
invention as will be described. Accordingly, other objects 
and a fuller understanding of the invention may be had by 10 
referring to the following Detailed Description of the pre- 
ferred embodiment 
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For a more complete understanding of the present inven- 
tion and the advantages thereof, reference should be made to 
the following Detailed Description taken in connection with 
the accompanying drawings in which: 

FIG. 1 illustrates a computer network in which the present 20 
invention is implemented; 

FIG. 2 illustrates a client computer supporting an HTML- 
compliant Worldwide Web browser; 

FIG. 3 is a flowchart diagram of a preferred method of the 
present invention for dynamic display of an information 25 
object during linking; 

FIG. 4 is a representative graphical user interface illus- 
trating browser navigation tools; 

FIG. 5 is a representative web page illustrating a hypertext 30 
link; 

FIG. 6 is a view of the HTML source code for the web 
page of FIG. 5; 

FIG. 7 is an example of a modified version of the HTML 
source code for the web page illustrated in FIG. 5, showing 35 
an information object embedded therein through a comment 
tag; and 

FIG. 8 is a representative screen display illustrating how 
the information object appears as a "mini" web page upon 
activation of the hypertext link in the web page of FIG. 5. 40 

DETAILED DESCRIPTION 

As represented in FIG. 1, the Internet is a known com- 
puter network based on the client-server model. Conceptu- 45 
ally, the Internet comprises a large network of "servers" 10 
which are accessible by "clients" 12, typically users of 
personal computers, through some private Internet access 
provider 14 (such as Internet America) or an on-line service 
provider 16 (such as America On-line, Prodigy, Com- 50 
puserve, the Microsoft Network, and the like). Each of the 
clients may run a browser," which is a known software tool 
used to access the servers via the access providers. A server 
10 operates a so-called *\veb site" which supports files in the 
form of documents and pages. A network path to a server is 55 
identified by a so-called Uniform Resource Locator or URL 
having a known syntax for defining a network connection. 

The "World Wide Web" (WWW) is that collection of 
servers of the Int ernet that utilize the Hypertext Transfer 
Protocol (HTTP). HTTP is a known application protocol that 60 
provides users access to files (which can be in different 
formats such as text, graphics, images, sound, video, etc.) 
using a standard page description language known as Hyper- 
text Markup Language (HTML). HTML provides basic 
document formatting and allows the developer to specify 65 
"links" to other servers and files. Use of an HTML-compli- 
ant client browser involves specification of a link via the 



URL. Upon such specification, the client makes a tcp/ip 
request to the server identified in the link and receives a 
"web page" (namely, a document formatted according to 
HTML) in return 

FIG. 2 shows a block diagram of a representative "client" 
computer in which the present invention is implemented. 
The system unit 21 includes a system bus or plurality of 
system buses 31 to which various components are coupled 
and by which communication between the various compo- 
nents is accomplished. The microprocessor 32 is connected 
to the system bus 31 and is supported by read only memory 
(ROM) 33 and random access memory (RAM) 34 also 
connected to system bus 31. The ROM 33 contains among 
other code the Basic Input-Output system (BIOS) which 
controls basic hardware operations such as the interaction 
and the disk drives and the keyboard. The RAM 34 is the 
main memory into which the operating system and applica- 
tion programs are loaded. The memory management chip 35 
is connected to the system bus 31 and controls direct 
memory access operations including, passing data between 
the RAM 34 and hard disk drive 36 and floppy disk drive 37. 
The CD ROM 42, also coupled to the system bus 131, is 
used to store a large amount of data, e.g., a multimedia 
program or large database. 

Also connected to this system bus 31 are various I/O 
controllers: the keyboard controller 38, the mouse controller 
39, the video controller 40, and the audio controller 41. The 
keyboard controller 38 provides the hardware interface for 
the keyboard 22, the controller 39 provides the hardware 
interface for the mouse (or other point and click device) 23, 
the video controller 40 is the hardware interface for the 
display 24, and the audio controller 41 is the hardware 
interface for the multimedia speakers 25a and 25b. A modem 
50 enables communication over a network 56 to other 
computers over the computer network. 

The operating system 60 of the computer may be DOS, 
WINDOWS 3.x, WINDOWS '95, OS/2, AJX, or any other 
known and available operating system, and each computer is 
sometimes referred to as a machine. RAM 34 also supports 
a num ber of Internet access tools including, for example, the 
HTTP-compliant web browser 62. Known browser software 
includes Netscape, Netscape Navigator 2.0, Mosaic, and the 
like. The present invention is designed to operate within any 
of these known or developing web browsers, which are 
preferably modified as described herein to achieve the 
dynamic display of information objects during web site 
linking activities. RAM 34 may also support other Internet 
services including simple mail transfer protocol (SMTP) or 
e-mail, file transfer protocol (FTP), network news transfer 
protocol (NNTP) or "Usenet", and remote terminal access 
(Telnet). 

Hyperlext Markup Language uses so-called "tags," 
denoted by the o symbols, with the actual tag between the 
brackets. Most tags have a beginning (<tag>) and an ending 
section, with the end shown by the slash symbol (</tag>). 
There are numerous link tags in HTML to enable the viewer 
of the document to jump to another place in the same 
document, to jump to the top of another document, to jump 
to a specific place in another document, or to create and 
jump to a remote link (via a new URL) to another server. 
Links are typically displayed on a web page in color or with 
an underscore. In response to the user pointing and clicking 
on the link, the link is said to be "activated" to begin the 
download of the linked document or text. For more details 
on HTML, the reader is directed to the HTML Reference 
Manual, published by Sandia National Laboratories, and 
available on the Internet at "http://www.sandia.gov/sci_ 
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compute/html Jef.html" or the HTML Quick Reference, pub- 
lished by the University of Kansas, and available on the 
Internet at "htrp://kuhttp.cc.u.kans.edu/lynx_help/HTML_ 
quick.htmT. Each of these publications are incorporated 
herein by reference. 5 

A known HTML tag is a "comment " which typically 
allows a web page developer to include text that is to be 
ignored by the browser. The syntax for a "comment" tag is 
denoted <!--text-->. HTML is an evolving language. Recent 
standards for new versions of this language propose to add 10 
SGML comment syntax to HTML elements. This proposal 
would begin a comment with a double dash encountered 
inside any HTML element (but no inside quotes), and treat 
every thing as comments (including any " ",*' or quote 
character) until the next occurring double dash. Such syntax 
allows HTML elements within a comment. 15 

According to the present invention, an information 
"object" is preferably placed within a comment tag of a web 
page and thus is "ignored" by the browser in the formatting 
of the document then being displayed. This information 2Q 
object, however, is also saved to a separate file or cache 
within the client. A particular web page may have multiple 
information objects, with one or more objects associated 
with one or more links in the documents. Thus, for example, 
if the document has two links, one information object is ^ 
associated with the first link and a second information object 
is associated with a second link, and so on. Or, multiple 
information objects may be associated with a single link. Or, 
the information object(s) may have no direct relation to the 
content of any link in the document. While in the preferred 30 
embodiment an HTML "comment" tag is used to mask the 
information object, those skilled in the art will recognize that 
other HTML commands and tags may be used for this 
purpose as well including, for example, tag dedicated to 
masking an information object within the currently-dis- 35 
played page. For example, an information object may be 
hidden within a clickable image identified with an ismap tag. 
Also, an information object may be formatted as a "mini" 
web page by nesting HTML elements within a particular 
HTML comment tag. 

40 

. As noted above, a web browser 62 running on the client 
uses a TCP/IP connection to pass a request to a web server 
running a HTTP "service" (under the WINDOWS operating 
system) or "daemon" (under the UNIX operating system). 
The HTTP service then responds to the request, typically by 45 
sending a "web page" formatted in the Hypertext Markup 
Language, or HTML, to the browser. The browser then 
displays the web page using local resources (e.g., fonts and 
colors). 

A preferred operation of the inventive method is illus- 50 
trated in the flowchart of FIG. 3. The method begins at step 
70 as a current web page is being displayed on the graphical 
user interface of the computer. It is assumed that this web 
page has embedded therein one or more comment tags, each 
of which (or perhaps several of which in combination) 55 
define an information object. Generally, although not 
required, each information object will be provided for one or 
more links in the web page being displayed. However, 
because the information object is embedded within a com- 
ment tag, it is hidden or "masked" and thus is ignored by the 60 
display routines of the browser. In step 72, the method saves 
or stores the information object in memory or some dedi- 
cated portion of the RAM (e.g., a cache) so that it may be 
easily and quickly obtained. At step 74, a test is made to 
determine whether a link associated with the information 65 
object has been activated. If so, the method continues at step 
76 and issues a tcp/ip request to the network (assuming the 



link was to a. URL). Step 78 represents the handshaking 
period during which the client waits for the appropriate 
response from the server. During this period, the client 
retrieves the information object (at step 80) and outputs the 
information (in step 82) to the user on the display. Steps 80 
and 82 are shown in parallel to the handshaking and wait 
step 78 to emphasize the inventive concept of displaying 
useful information to the viewer during the link process. At 
step 84, a test is then performed to determine whether the 
download and refresh of the display is complete. If so, the 
routine saves the information object at step 86 and opens up 
access to the hypertext document at step 88. 

FIG. 4 shows the browser navigation tool prior to down- 
load of the U.S. Patent and Trademark Office page (available 
at http://www.uspto.gov), FIG. 5 shows the web page as it 
exists on the display. This web page has various links 
including "Welcome to the United States Patent and Trade- 
mark Office." FIG. 6 shows the HTML source code used to 
generate the web page of FIG. 5, and FIG. 7 shows this 
source code modified to include an information object 75 
within a comment tag. This object displays the message 
'The PTO Welcomes You" when the "Welcome to the 
United States Patent and Trademark Office" link is activated. 
FIG. 8 shows the effect of this information object when the 
routine of FIG. 3 is carried out 

Although the invention has been described in terms of a 
preferred embodiment, those skilled in the art will recognize 
that various modifications of the invention can be practiced 
within the spirit and scope of the appended claims. Thus, for 
example, the information supplied to the user during the 
period between link activation and downloading of the 
hypertext document need not be merely a visual output It is 
also envisioned that some or all parts of a particular message 
be conveyed to the user aurally (via a multimedia speaker 
set, for example) as well as on the display screen. The 
message itself may be retained on the screen as an inline 
image or other text along with the downloaded hypertext 
document, and the browser includes appropriate means to 
queue the message to print and/or to save the message or 
allow the user to compose a response to the message. One 
such technique for responding to the message uses the 
HTML "fill-in" form tags. The browser may be suitably 
programmed to queue the mini web page for background 
printing whenever the link is activated. 

Moreover, although in the preferred embodiment it is 
envisioned that the hypertext document (associated with the 
web page link) is located on a remote server, this is not a 
limitation of the invention. The display of informational 
messages may be effected whenever a link is activated, 
regardless of the location of the target document. Also, while 
the preferred embodiment has been described in the context 
of an Internet browser, the techniques of the invention apply 
whether or not the user accesses the Worldwide Web via a 
direct Internet connection (namely using an Internet access 
provider) or indirectly through some on-line service pro- 
vider (such as America- On-Line, Prodigy, Compuserve, the 
Microsoft Network, or the like). Thus the "computer net- 
work" in which the invention is implemented should be 
broadly construed to include any client-server model from 
which a client can link to a "remote" document (even if that 
document is available on the same machine or system). 

It should also be appreciated that while in the preferred 
embodiment the information object is formatted and dis- 
played upon activation of a link in a web page being 
currently displayed, this is not a limitation of the invention 
either. The information object need not be embedded within 
an existing web page, but rather may be embedded within 
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the home page of the browser or supported elsewhere within 
the client itself. Thus, the information object may be dis- 
played whenever a call to a web page is made, such as when 
a search to a particular URL is initiated, or when a previ- 
ously-stored URL is launched (e.g., from a "View Book- 5 
mark" pulldown menu). Moreover, the client may store 
random information objects in the form of information 
advertisements (which in turn may include .gif files to 
produce images) so that the browser may call any such 
information at random. The browser may even be pro- 10 
grammed to select which of the plurality of information 
objects to display based on a comparison of the type of web 
pages accessed by the user. Thus, for example, if the user 
accesses web pages relating to a particular service, the 
browser may be programmed to identify this access history 15 
and select predetermined information objects that will be of 
interest to the user (given that history). 

The information objects may themselves be HTML "fill- 
in" forms that are retained on the display screen and may be 
filled in with information that the browser can then deliver 20 
back to some third party service provider. This enables the 
information objects to be used as mini survey forms for 
interactive, on-line surveys and the like, which would be 
especially advantageous for web site providers or other third 
parties. 25 

As noted above, the information object may be automati- 
cally or selectively queued to the client printer upon display. 
This would enable the viewer to generate merchandise 
coupons and the like related to the web page being accessed. 
Thus the web site provider could offer the viewer some 30 
added incentive for accessing its web page by causing the 
printing of a redeemable coupon or other information token 
(e.g., a discount card, a receipt, etc). All of these actions are 
initiated during the otherwise downtime between web page 
access and download, thereby significantly increasing the 35 
value of the on-line informational content provided to the 
user. 

As used herein, the "information object' ' or "information" 
output to the viewer during the link process should be ^ 
broadly construed to cover any and all forms of messages, 
notices, text, graphics, sound, video, tables, diagrams, 
applets and other content, and combinations of any of the 
above. 

One of the preferred implementations of the "browser" of 45 
the invention is as a set of instructions in a code module 
resident in the random access memory of the user's personal 
computer. Until required by the computer, the set of instruc- 
tions may be stored in another computer memory, for 
example, in a hard disk drive, or in a removable memory 50 
such as an optical disk (for eventual use in a CD ROM) or 
floppy disk (for eventual use in a floppy disk drive). In 
addition, although the various methods described are con- 
veniently implemented in a general purpose computer selec- 
tively activated or reconfigured by software, one of ordinary 55 
skill in the art would also recognize that such methods may 
be carried out in hardware, in firmware, or in more special- 
ized apparatus constructed to perform the required method 
steps. 

Finally, the present invention is designed to be imple- 60 
merited with conventional HTML and enhancements thereto 
(including HTML 2.0, HTML 3.0, HTML with third party- 
supplied extensions such as NHTML, and the like), by an 
HTML-compliant browser, such as Netscape, Netscape 
Navigator 2.0, Mosaic, MSN, as such existing or developed 65 
programs are modified to include the functionality of the 
invention described above. Netscape Navigator 2.0 has 
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in-line support for platform-independent application objects 
(e.g. t applets written in JavaScript, from Sun Microsystems). 
An applet resides on the server associated with a web page 
and is downloaded to the client browser after a link is 
established to the web page. The browser includes an engine 
for executing the downloaded applets. With this type of 
browser, the invention caches or otherwise stores a down- 
loaded applet and later uses it, preferably when a new, 
related link is established. Thus, an information object" 
according to the invention may include an applet which, for 
example, may generate an animated figure or icon, some 
aural output, a scrolling display, or a combination thereof. 
One of ordinary skill, however, will recognize that the 
inventive features of the invention, including the masking of 
"mini" hypertext documents within a web page and display 
of such documents upon link activation, may be applied to 
other Internet services as well as to HTTP compliant brows- 
ers. Thus, the invention would be useful to provide infor- 
mation to a user during an FTP access, an on-line chat, a 
posting to a bulletin board, or even during the sending and 
retrieval of e-mail. All such variations are considered within 
the scope of the invention. 

Having thus described my invention, what I claim as new 
and desire to secure by Letters Patent is set forth in the 
following claims. 

What is claimed is: 

1. A method of browsing in a computer network having at 
least one client connectable to one or more servers, the client 
having an interface for displaying a first hypertext document 
with at least one link to a second hypertext document located 
at a server, comprising the steps of; 

contacting the server in response to activation of the link 
to initiate downloading of the second hypertext docu- 
ment from the server to the client; 

in response to activation of the link, retrieving an infor- 
mation object that has been stored in the client prior to 
the activation of the link; and 

displaying the retrieved information object on the inter- 
face during at least a portion of a time period between 
the activation of the link and completion of the down- 
loading of the second hypertext document from the 
server to the client to provide information to a user 
during a process of linking from the first hypertext 
document to the second hypertext document; 

wherein the information object is stored within the first 
hypertext document and is not displayed on the inter- 
face until after activation of the link that initiates 
downloading of the second hypertext document from 
the server to the client. 

2. The method of browsing of claim 1 wherein the 
information object is stored within the first hypertext docu- 
ment using a markup tag that normally specifies a comment 
within a hypertext document. 

3. The method of browsing of claim 2 wherein the markup 
tag includes nested HTML tags to format the information 
object within the markup tag. 

4. A method of browsing in a computer network having at 
least one client connectable to one or more servers, the client 
having an interface for displaying a first hypertext document 
with at least one link to a second hypertext document located 
at a server, comprising the steps of: 

contacting the server in response to activation of the link 
to initiate downloading of the second hypertext docu- 
ment from the server to the client; 

in response to activation of the link, retrieving an infor- 
mation object that has been stored in the client prior to 
the activation of the link; and 
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displaying the retrieved information object on the inter- 
face during at least a portion of a time period between 
the activation of the link and completion of the down- 
loading of the second hypertext document from the 
server to the client to provide information to a user 5 
during a process of linking from the first hypertext 
document to the second hypertext document; 

wherein the information object includes copyright man- 
agement information for the second hypertext docu- 
ment. 10 

5. The method of browsing of claim 4 wherein the 
copyright management information includes terms and con- 
ditions for use of a copyrighted work being downloaded 
from the server to the client. 

6. A method of browsing in a computer network having at 1 5 
least one client connectable to one or more servers, the client 
having an interface for displaying a first hypertext document 
with at least one link to a second hypertext document located 

at a server, comprising the steps of: 

contacting the server in response to activation of the link 20 
to initiate downloading of the second hypertext docu- 
ment from the server to the client; 

in response to activation of the link, retrieving an infor- 
mation object that has been stored in the client prior to 25 
the activation of the link; and 

displaying the retrieved information object on the inter- 
face during at least a portion of a time period between 
the activation of the link and completion of the down- 
loading of the second hypertext document from the 30 
server to the client to provide information to a user 
during a process of linking from the first hypertext 
document to the second hypertext document; 

wherein the information object includes an advertisement. 

7. The method of browsing of claim 6 wherein the 35 
advertisement advertises goods or services described in the 
second hypertext document 

8. The method of browsing of claim 7 further including 
the step of queueing the information object to a printer 
associated with the client to thereby print a coupon for the 40 
goods or services. 

9. A method of browsing the World Wide Web of the 
Internet using a client machine supporting a browser, com- 
prising the steps of: 

storing an information object; 45 

activating a link from a first hypertext document to initiate 
downloading of a second hypertext document; 

retrieving the stored information object in response to 
activating the link; and 5Q 

displaying the information object during at least a portion 
of a time period between activation of the link and 
completion of the downloading of the second hypertext 
document to provide information to a user of the client 
machine as the browser links from the first hypertext 55 
• document to the second hypertext document; 

wherein the information object is stored within the first 
hypertext document and is not displayed until after 
activating the link that initiates downloading of the 
second hypertext document. 60 

10. The method of browsing of claim 9 further including 
the step of queueing the imfonnation object to a printer. 

11. The method of browsing of claim 10 wherein the 
information object is an advertisement. 

12. In a computer having a processor, an operating 65 
system, a graphical user interface and a browser, the 
improvement comprising: 



means, responsive to activation of a link from a web page 
that initiates downloading of a linked hypertext docu- 
ment, for retrieving information masked within the web 
page; and 

means for outputting the information during at least a 
portion of a time period between activation of the link 
and completion of the downloading of the linked hyper- 
text document to provide information a user of the 
computer as the browser links from the web page to the 
linked hypertext document. 

13. In the computer system as described in claim 12 
wherein the information is outputted on the graphical user 
interface.- 

14. In the computer system as described in claim 12 
wherein the information is masked by an HTML comment 
tag. 

15. In the computer system as described in claim 12 
wherein the HTML comment tag includes HTML tags 
nested therein to format the information. 

16. In a computer having a browser for retrieving hyper- 
text objects from servers in a computer network, the browser 
including an engine for executing applets, the improvement 
comprising: 

means, responsive to activation of a link from a hypertext 
object that initiates downloading of a linked hypertext 
object, for retrieving an applet downloaded to the 
computer prior to activation of the link; and 

means for outputting the applet during at least a portion of 
a time period between activation of the link and 
completion of the downloading of the linked hypertext 
object to thereby provide information to a user of the 
computer as the browser Jinks from the hypertext object 
to the linked hypertext object. 

17. In a computer having a browser for retrieving hyper- 
text objects from servers in a computer network, the 
improvement comprising: 

means, responsive to activation of a link from a hypertext 
object that initiates downloading of a linked hypertext 
object, for retrieving an information object downloaded 
to the computer prior to activation of the link; and 

means for outputting the information object during at least 
a portion of a time period between activation of the link 
and completion of the downloading of the linked hyper- 
text object to thereby provide information to a user of 
the computer as the browser links from the hypertext 
object to the linked hypertext object: 

wherein the information object is an advertisement. 

18. The computer as described in claim 17 wherein the 
advertisement is selected based on the link. 

19. A method of browsing in a computer network having 
a plurality of servers that support hypertext objects, com- 
prising the steps of: 

storing a plurality of information objects at a client 
machine having a browser, wherein the information 
objects are downloaded to the client machine from the 
computer network; 

in response to activation of a link that initiates download- 
ing of a hypertext object, selectively displaying one of 
the stored information objects during at least a portion 
of a time period between activation of the link and 
completion of the downloading of the hypertext object 
to provide information to a user of the client machine 
during a process of linking to the hypertext object; 

wherein the information objects are advertisements for 
goods or services. 
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